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Pneumococcal histidine triad protein E precursor 1039 
STRPN (Hypothetical AA 

protein SP1004) [phtE] [Streptococcus pneumoniae] align 




Score = 2016 bits (5224), Expect =0.0 

Identities = 1004/1039 (96%), Positives = 1004/1039 (96%) 

Query : 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MKFS KK Y I AAGSAVT V S L3 LCAYALNQHRS QENKDMKR V3 Y QS 3 QXS 'EN LT PDQV'S 
Sbjct : 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 



query: 61 qkegiqaeqivikitdqgyvtshgdhyhyyngkvpydalfseellmkdpnyqlkdadivn 120 

qke gi qaeq 1 v i kl tdqgy vt 3 hgdk yhy yngkvpy dal fs e e llmkdpn yqlkdadi vn 

sbjct: 61 qkegiqaeqivikitdqgyvtshgdhyhyyngkvpydalfseellmkdpnyqlkdadivn 120 

query: 121 evkggyiikvdgkyyvylkdaahadnvrtkdeinrqkqehvkdnekvnsbfvavarsqgry 180 

evkggy 1 1 1'o/dgky yv y i^<daahai>nvrtkde i nrqkoe hvk.dne nvi\var s qgr y 

sbjct: 121 e vkggy 1 1 kvdgky yvylkdaahadnvrtkde i nrqkqehvkdne kvnsnvavarsqgr y 180 

query: 181 ttndgyvfnpadiiedtgnayivphgghyhyipxxxxxxxxxxxxxxxxxxxnmqpsqls 240 

ttndgyvfnpadii^dtgnayivphgghyhyxp nmftpsqls 

sbjct: 181 ttndgyvfnpadiiedtgnayivphgghyhyipksdlsaseltw^kahlagknmqpsqls 240 

query: 241 ysstasdnntqsvakgstskpanksenlqsllkelydspsaqrysesdglvfdpakiisr 300 

ys staftvonctqs vakgst8kpankse^^:jqs llke lyds ps a.q£ ys e sdgl vfdfakt 1 3 y< 

sbjct: 241 ysstasdnntqsvakgstskpanksenlqsllkelydspsaqrysesdglvfdpakiisr 300 

query: 301 tpngvaiphgdhyhfipysklsaleekiarmvpisgtgstvstnakpnewxxxxxxxxx 360 

tpngvaiphgdkyhfi?ysklsalei>ki;\kmvpisgtg3tvstn;\kpnevv 

sbjct: 301 tpngvaiphgdhyhfipysklsaleekiarmvpisgtgstvstnakpnewsslgslssn 360 

query: 361 xxxxxxxkelssasdgyifnpkdiveetatayivrhgdhfhyipksnqigqptlpnnsla 42 0 

ke ls s asdgy i fnpkdi ve etatay i vrhgdhfhy i pksnqi gq ptl ?nns la 

sbjct: 361 psslttskelssasdgyifnpkdiveetatayivrhgdhfhyipksnqigqptlpnnsla 42 0 

query: 421 tpspslpinpgtshekheedgygfdanriiaedesgfvmshgdhnhyffkkdlteeqika 48 0 

t p 3 ?s i: ? i n pgt s he khf; e d gy gfdanr 1 1 ae dp; 3 g fvms h gdi-inky f fkkd t.:t e e q i ka 

sbjct : 421 tpspslpinpgtshekheedgygfdanriiaedesgfvmshgdhnhyffkkdlteeqika 480 

query: 481 aqkhleevktshngldslssheqdypsnakemkdldkkieekiagimkqygvkresiwn 540 

aqkhl££vktshnglpsi;ssh£qdypsn^^ 

sbjct : 481 aqkhleevktshngldslssheqdypsnakemkdldkkieekiagimkqygvkresiwn 540 

query: 541 keknaiiyphgdhhhadpidehkpvgighshsnyelfkpeegvakkegnkvytgeeltnv 600 

K£K>;AI I x PKGDKKHADP VTjKHKPVGX GHSHSN YE L'FKPE EGVAKKPiGNKV YTGEP. LTN V 

Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

VNLLKNStFNNQNFTLANGQKRVSF 

Sbjct : 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

Query: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

Z GVGN X AM EE LDQ P Y L ? GQ'T FK Y T I A.S KD Y PE V S Y D GT FT V PT S LAY KMAS QT X F Y ? FH A. 

Sbjct: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 720 

Query : 721 GDT Y LRVN PQ FAV PKGTDALVRV FDE FHGNAYLENNYKVGE I KLPI PKLNQGTTRTAGNK 780 

GOT Y LRVN FQ FAV PKGTDALVRVFDS FKGN A Y LS NN Y?KVGE X K L P I PKLNQG7 T RT AGNK 

Sbjct : 721 GDTYLRVNPQFAV PKGTDALVRV FDE FHGNAYLENNYKVGE I KLPI PKLNQGTTRTAGNK 780 
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Query: 7 81 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTS 840 

I FVT FMA N AY LDNQST Y I VEVPT. LFJvENQTDKFS X L PQ FKRNKAQSNLKLDE KYE E ?KTS 
Sbjct: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Sbjct: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

S GE V 'X KKTxMAO FT OK A PQGNGF.N KPBKN GKV'S T GT VKN QPTSNKPADS L APNK KP v'K P 
Sbjct: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 102 0 

STDN GM'LN PF. GN VGS O PMLD PALS n. A PAVDPYQF. KLE K'FTAS YGLCy L DSV .1 FMMDGT .T. 
Sbjct : 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

£LRLPSG£VIKKML3DLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 1039 



tr Q3DQ07 Pneumococcal histidine triad protein E [phtE] 1039 

Q8DQ07_STRR6 [Streptococcus AA 

pneumoniae (strain ATCC BAA-255 / R6) ] align 



Score = 2006 bits (5196), Expect =0.0 

Identities = 998/1039 (96%), Positives = 1000/1039 (96%) 

Query: 1 MKFSKKY IAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MKFSKKY I AAGS AVI VS 1*3 LCAY7\LNQHP. 3QENKDNNRVS YVDGSQ3 SIQKSFNLT PDOVS 
Sb j ct : 1 MKFSKKY I AAGS AVI VSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 12 0 

QKE SI OAF- Q I V I K I TDQ G Y VT S H GDH Y H Y YNGKV P Y DAI: F S 3 E L LMKD FN Y Q LKD A D I VN 
Sbjct: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 12 0 

Query: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 180 

Sbjct: 121 EVKGGY 1 1 KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKDNEKVNSNVAVARSQGRY 180 

Que ry : 181 TTNDGYVFNPADI IEDTGNAYI VPHGGHYHYI PXXXXXXXXXXXXXXXXXXXNMQPSQLS 240 

TTNDGYVFNPADI IEDTGNAY IV PHGGHYHY I P NKQPSQLS 
Sbjct: 181 TTNDGYVFNPADI I EDTGNAY I VPHGGHYHY I PKSDLSASELAAAKAHLAGKNMQPSQLS 240 

Query: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Y 3 3 17**3 DNN T Q S VAKGS T 3 K PANKS E N LQ 3 L LXS L Y O S P 3 AQR Y S £ 3 D GLV FD PAK 1 1 S R 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Que ry : 301 TPNGVAI PHGDHYHFI PYSKLSALEEKIARMVPI SGTGSTVSTNAKPNEWXXXXXXXXX 360 

TPNGVAI PHGDHYHFIPYSKLSALEEKIAR VPI.SGTG3TVSTNAKPNSW 
Sbjct: 301 TPNGVAI PHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWSSLGSLSSN 360 

Query: 361 XXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 42 0 

KF; L3 S7\S DGY I FN PKDI V£E TAT AY I VRKGDH FH Y I PKSNQI GQPT LPNNS LA 
Sbjct: 361 PSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 
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Query : 421 TPSPSLPINPGTSHEKHEEDGYGFDANRI IAEDESGFVMSHGDHNHYFFKKDLTEEQIKA 480 

TF3PSLPINFG S HE KH E E DG Y GFDANR X I AE DE 3 GF* MS H G*:*KN?I Y F FKKDLT HI E Q I KA 
Sbjct: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

Query: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

AQKKLE E V KT 3 R'NGLDS 1:53 S HE Q D Y P 1^AKE^KDa:DKKXE^KIAGIHKQYGVK^E3IVV7^ 
Sbjct: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

Query: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

K' if; KN A.Y I V PKGDHKR AD P I HK'PVGi GHS HS N YE L FK PP. £ GVAKK £ GNK V Y T G £ F. LTN V 
Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

VN LLKN3 T FNN QNFT T. AN GQKR VS FS FP PS LEKKLG 1 NM7.iV KL .T. T PDGKV LP. KV3 GK.V PG 
Sbjct: 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

Query: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 720 

E GVGN 1" AN FE LDQ F Y L P GQx FKY T I A3 KD Y PE VS Y D GT FT V P T 3 LAY KMA8 QT 1 F Y P Fi£A 
Sbjct: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 720 

Query: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 780 

GDT Y LR VN PQ FAY PKG7 DALVRV FDS FH GN A Y LE NM Y KVGE I KT.t ? I PK LNQ GT T R.T AGNK 
Sbjct : 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 780 

Query: 7 81 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTS 840 

I PVTFMANAY LDNQ3T Y I VEVP I LSKSNQTDKPS J. LPQFKRNKAQEN KLDEK VEE PKTS 
Sbjct: 7 81 I PVTFMANAYLDNQSTYI VEVPI LEKENQTDKPS I LPQFKRNKAQENSKLDEKVEE PKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

EKVEKEKLSETGNSTSKSTLSSY PT VDPVQEKVAKFA^S V GMKLENYLrNMDGTIFLYL? 
Sbjct: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 9 60 

S GE V I KKNMAT- FT GE APQ GNGE NKP 3 E NG K VS T GTVE NQ PT Hi NK PAD 3 L P E APNE K PV K P 
Sbjct : 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 102 0 

ENS TDN GMLN PE GN VGS D PMLD PALE E APAV D PVQE KLE K FT AS Y GLG L D 3 V I FNHD GT I 
Sbjct : 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 102 0 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

ELRLPSGEVIKKNLSDLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 1039 




Surface protein BVH-3 
TRPN pneumoniae] 



[bvh-3 ] [ Streptococcus 



1039 
AA 

align 



Score = 2006 bits (5196), Expect =0.0 

Identities = 998/1039 (96%), Positives = 1000/1039 (96%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MK F5 KK Y I AAGS7WI VS L3 T.tCA Y ALNQHRSQENKDNNRVS YVDGS OS 3 QK3 ENLT PDQVS 
Sb j ct : 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 
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Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 
Sbjct : 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

Query: 121 EVKGGYIIK^DGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 180 

E VKGGY 1 1 KVDGK Y Y V Y LKDAAIii U5N VP.T KDE 1 NR QKQE KVKDN E KVNS N VAV AX< S Q G R Y 
Sbjct: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNS^^^AVARSQGRY 180 

Query: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS 240 

TTNDGYVFNPiU^IlEDTGNAYlVPHGGHYHYIP NMQPSQLS 
Sbjct: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLS 240 

Query: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Y S S T A3 DNNTQ S VAKGS T S K PANKS E NLQ 3 L LKE Ij Y D S P S7\OR Y S E 3 DGLV FD F AK 1 1 S R 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Query: 301 TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEWXXXXXXXXX 3 60 

TPNGV.AIPHGDHYHFIPY3KLSALEEKIAR VPI3GTGSTVSTNAKFNEVV 
Sbjct: 301 TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWSSLGSLSSN 360 

Query: 361 XXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

K^LSSABDGYIFNPKDIV^ETATAVIVRHGDHFHYIPKSNQIGQPTLPNNSLA 
Sbjct: 361 PSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

Query: 421 TPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKA 480 

TPSPSLPINPG S HE KKE E D GY G FDANR 1 1' AE DE S GF -f-MS HG * HNR Y if r KKD LTE E Q 1 KA 
Sbjct: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 4 80 

Query: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

AQKKLEEVKTSHKGLDSLSSKEQDYP N7\KEMKDLDKKI E EKI AGIMKQY GVKRE S I VVK 
. Sbjct: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

Query: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

KBKNAJIYPHGDHHIIADPIDEHKPV^^ 
Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

VmLKNSTFNNQN FT LA£fGQKRVSF^^ 
Sbjct: 601 VNLLKNSTFNNQNFTIjANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

Query: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 720 

SGYGNIA>;FELDQPYLPGQ?FKYT^^ 
Sbjct: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 720 

Query: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 7 80 

GDTYLRV1\ T PQFAVPKGTDA^ 
Sbjct: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 7 80 

Query: 7 81 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTS 840 

I PVT FMAN AY LDNQ STYX VE V PILE KENQT DK P 3 I L ?Q FKR NKAQE N KLDEKVEEPKTS 
Sbjct: 7 81 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

E KV'F: KE K T.i S S T GN S T 3 N S T LE E V PT VD PVQE KV AK F AE 3 Y GMKLENV T, Ft^ID G'T I E I: Y Ij P 
Sbjct: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 
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S GZ V I KKTsMAA; FT GP A PQGN GF.N K'PSE.N GKVS T GT V £ N Q PTEN K PAD.S L PE APN P KP v'KP 
Sbjct: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENS TDNGMLN PE GNVGS D PMLD PALE E APAVD PVQE KLE K FTAS Y GLGLDS V I FNMDGT I 1020 

SNSTDNGMLNPEGNVGSDPMI^ 
Sbjct: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

SLRLPSGEVIKKML3DLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 1039 



tr Q6WNQ5 Surface protein BVH-3 (Fragment) [bvh-3] [Streptococcus 1019 

Q 6WNQ 5_S T R PN pneumpni ae ] AA 

align 

Score = 1974 bits (5115), Expect = 0.0 

Identities = 981/1019 (96%), Positives = 981/1019 (96%) 

Query : 2 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 8 0 

CAYALNQKRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIC^QIVIKITDQGYV 
Sbjct : 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 60 

Query: 81 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKD 140 

T 3 H GDK Y I { Y Y NGKV ? Y DAL F £ E Ei LLMKD PHY Q LKDAD I VNE VKGG Y 1 I KVDGK Y YV Y LKD 
Sbjct : 61 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKD 120 

Query: 141 AAHADWRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 200 

A AHA DN VRT KX>E X NR QKQP H'V'KDN K KV7x S N VAV7 VR53 Q G.R Y T TND G Y V FN PAD .X I E DT GNA 
Sbjct: 121 AAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 180 

Query : 2 01 YIVPHGGHYHYI PXXXXXXXXXXXXXXXXXXXNMQPSQLS YSSTASDNNTQSVAKGSTSK 2 60 

YXVPK GHYHYT? NMQ PS QLS '{. S S T AS DNN T QS V AKG S T 8 K 

Sbjct: 181 YI VPHRGHYHY I PKSDLSASELA7U\KAHLAGKNMQPSQLS YSSTASDNNTQSVAKGSTSK 240 

Query: 261 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAI PHGDHYHFIPYSK 32 0 

PANKS E NLQS LLKE L YDS PS AQR Y S E 3 DG L V FD PAKI I SRT PNSVAI PHGDK Y H FX P Y SK 
Sbjct: 241 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAI PHGDHYHFIPYSK 300 

Query: 321 LSALEEKIARMVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFN 380 

I:S ALEE KI ARI v TV?I 3 GT GS TVS TNAXPNE VV KELSSASDGYIFN 
Sbjct: 301 LSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 360 

Query: 381 PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 440 

F KD I VFi E TAT AY I VRH GD.H FK Y I PK3NQ I GQ ?T L PMNS I -AT F 3 ? 5i I: P X N F GT £ HE KHE E D 
Sbjct: 361 PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 420 

Query: 441 GYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 500 

GY G STDAJN'R .1" A" AE DE 3 GPV>: S H GDHN H Y E V'KKDXiT h- E Q X KAAQK'K LE E V KT S HNGX.D £ L S 53 
Sbjct: 421 GYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 480 

Query: 501 HEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPID 560 

HEQDYPoNAKE^KDLDKKI^H;KXAGXMKQYGVKKE3XVV7^KEKNAI 1 V PHGDHHHADPXD 
Sbjct: 481 HEQDY PSNAKEMKDLDKKI EEKIAGIMKQYGVKRES I WNKEKNAI I Y PHGDHHHADPID 540 

Query: 561 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNFTLANGQ 620 
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Sbjct: 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTK\A/NLLKNSTF^QNFTLANGQ 600 

Query: 621 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 680 

K:<VSFS FPPELFXKLOT 

Sbjct : 601 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 660 

Query: 681 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 740 

FKY T I A3 KD Y PE V 5 Y DGT FT V FTS LAYKMA3QT 1 FY PFIiAGDT Y LKV NPQ FAV PKGTDAL 
Sbjct: 661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 720 

Query: 741 VRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFM7\NAYLDNQSTYIVE 8 00 

VP. VFDE FHGNAY LS T#r?ri , CVGE I KL ? I PKLNQGTTRTAGNK I FVT FKANAY LDNOS T Y I VE 
Sbjct: 721 VRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVE 780 

Query: 8 01 VPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 8 60 

V PI L&KENQTDKPS X L ??QFKRNKAQF.N" K DEKV^^PKTSEKVTnK^KLS^TGNSTSNSTL 
Sbjct: 781 VPILEKENQTDKPSILPQFKRNKAQENSKFDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

Query: 8 61 EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQG 92 0 

E £ V PT V .0 PVQE K V AKF AE S Y GMK'LE N V L PNMDG T I E L Y L p SGEVIKK^VfADFTGPAPQGN 
Sbjct: 841 EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGN 900 

Query : 921 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 98 0 

GENKPS EN GKVS T GT VEN Q ?T £ NKPADS L PEAPN £ KPVKPEN 3TDNGMLN PE GN VGS D PM 
Sbjct: 901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 

Query: 981 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA 1039 

LD PALE E APAVD PV QE KLH! K FT AS Y GL GLD3 V I F N'MD GT I E LR L P S GE V 1 KKM L3 D L 1 A 
Sbjct: 961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA 1019 
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ABSTRACT 



, A vaccine composition is disclosed that comprises polypep- 
tides and fragments of polypeptides containing histidine 
triad residues or coiled -coil regions, some of which polypep- 
tides or fragments lie between 80 and 680 residues in length. 
Also disclosed are processes for preventing infection caused 
by S. pneumoniae comprising administering of vaccine 
compositions. 
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ABSTRACT 



The invention is directed to isolated polypeptides bearing 
sequence homology to the Sp36 protein found in pneumo- 
coccal organisms, such as Streptococcus pneumoniae. Poly- 
nucleotides encoding such polypeptides are also disclosed. 
The invention also relates to antibodies specific for the 
disclosed polypeptides and to uses of such antibodies in the 
treatment of diseases caused by staphylococci as well as 
group A and B streptococci. In addition, the invention relates 
to the use of the disclosed polypeptides in compositions and 
as vaccines and for prophylactic uses such as in vaccination 
of animals, especially humans, against a wide variety of 
streptococcal, staphylococcal and other diseases. 
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ABSTRACT: 

The invention is directed to isolated polypeptides bearing sequence homology to the Sp36 protein found 
in pneumococcal organisms, such as Streptococcus pneumoniae. Polynucleotides encoding such 
polypeptides are also disclosed. The invention also relates to antibodies specific for the disclosed 
polypeptides and to uses of such antibodies in the treatment of diseases caused by staphylococci as well 
as group A and B streptococci. In addition, the invention relates to the use of the disclosed polypeptides 
in compositions and as vaccines and for prophylactic uses such as in vaccination of animals, especially 
humans, against a wide variety of streptococcal, staphylococcal and other diseases. 
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CLAIMS: 

What is claimed is: 



1. An isolated polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to the sequence of SEQ ID NO: 4 and wherein said polypeptide 
binds to an antibody that is specific for Sp36 (SEQ ID NO: 7) . 



2. An isolated polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to a sequence selected from the group consisting of SEQ ID 
NO: 2 and 4 wherein said polypeptide is identical to that found in an organism 
selected from the group consisting of Group A streptococci and Staphylococcus 
aureus and wherein said polypeptide binds to an antibody that is specific for 
Sp36 (SEQ ID NO: 7) . 



3. The isolated polypeptide of claim 2 wherein said Group A organism is 
Streptococcus pyogenes . 



4. The isolated polypeptide of claim 2 wherein said organism is Staphylococcus 
aureus . 

5. An isolated polypepbde comprising an amino acid sequence at least 95% 
identical to the sequence of SEQ ID NO: 4 and wherein said polypeptide has a 
sequence with at least 12.6% sequence identity to the amino acid sequence of 
the Sp36 protein (SEQ ID NO: 7) of Streptococcus pneumoniae and wherein said 
isolated polypeptide binds to an antibody that is specific for Sp36. 



6. An isolated polypeptide comprising the sequence of SEQ ID NO: 2 .wherein 
said isolated polypeptide binds to an antibody that is specific for Sp36 (SEQ 
ID NO: 7) of Streptococcus pneumoniae. 



7. An isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 2. 

8. An isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 4. 



http://westbrs:9000Mn/gate.exe?f=^^ 6/20/05 



Record Display Form 



Page 1 of 3 



First Hit 



Previous Doc Next Doc Go to Doc# 



L9: Entry 29 of 34 



File: DWPI 



Feb 10, 2005 



DERWENT-ACC-NO: 2003-120461 
DERWENT-WEEK: 200511 

COPYRIGHT 2005 DERWENT INFORMATION LTD 

TITLE: New BVH-A4 proteins and genes from serotype III Group B streptococcus , 
useful for treating or preventing streptococcal infection in infants, pregnant 
women, non-pregnant adults (e.g. pneumonia ) , or members of dairy herd (mastitis) 

INVENTOR: BOYER, M; BRODEUR, B R ; HAMEL, J ; MARTIN, D ; RIOUX, S 

PATENT-ASSIGNEE: SHIRE BI0CHEM INC (SHIRN) , BOYER M (BOYEI), BRODEUR B R (BRODI), 
HAMEL J (HAMEI), MARTIN D (MARTI), RIOUX S (RIOUI) 

PRIORITY-DATA: 2 001US-2 877 12P (May 2, 2001) 



PATENT- FAMILY: 
PUB-NO 

f] JP 2005503774 W 
□ WO 200288178 A2 

EP 1390505 A2 
fj AU 2002308325 Al 
fj US 20040171113 Al 



PUB-DATE 

February 10, 2005 
November 7, 2002 
February 25, 2004 
November 11, 2002 
September 2, 2004 



LANGUAGE PAGES MAIN-IPC 

095 C12N015/09 

E 059 C07K014/315 

E 000 C12N015/31 

000 C07K014/315 

000 C07H021/04 



DESIGNATED- STATES : AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE 
DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS 
LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN 
TR TT TZ UA UG US UZ VN YU ZA ZM ZW AT BE CH CY DE DK EA ES FI FR GB GH GM GR IE IT 
KE LS LU MC MW MZ NL OA PT SD SE SL SZ TR TZ UG ZM ZW AL AT BE CH CY DE DK ES FI FR 
GB GR IE IT LI LT LU LV MC MK NL PT RO SE SI TR 



A P P L I CAT ION- DATA : 
PUB-NO 



APPL-DATE 



JP2005503774W 


May 


2, 


2002 


JP2005503774W 


May 


2, 


2002 


JP2005503774W 








WO 2 002 8 817 8A2 


May 


2, 


2002 


EP 1390505A2 


May 


2, 


2002 


EP 1390505A2 


May 


2, 


2002 


EP 1390505A2 








AU2002308325A1 


May 


2, 


2002 



APPL-NO 

2002JP-0585476 

2002WO-CA00664 

WO 200288178 

2002WO-CA00664 

2002EP-0766595 

2002WO-CA00664 

WO 200288178 

2002AU-0308325 



DESCRIPTOR 



Based on 



Based on 



http://westbrs:9000ftin/gate.ex^ 6/20/05 



Record Display Form 



Page 2 of 3 



AU2002308325A1 



WO 200288178 



Based on 



US20040171113A1 



May 2, 2002 
April 19, 2004 



2002WO-CA00664 



US20040171113A1 



2004US-0476614 



INT-CL (IPC) : A61 K 38/00; A61 K 39/00; A61 K 39/09; A61 P 31/04; C07 H 21/04; C07 
K 14/ 315 ; C07 K JL9/£0; C12 N 1/15; C12 N 1/19; C12 N 1/21; C12 N 5/K); C12 N 15/09; 
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ABSTRACTED- PUB-NO: WO 200288178A 
BASIC-ABSTRACT: 

NOVELTY - An isolated polypeptide (designated GBS-BVH-A4 ) , which comprises a BVH-A4 
protein polypeptide from serotype III Group B streptococcus (GBS) strain COH1, is 
new. 

DETAILED DESCRIPTION - An isolated polypeptide (designated GBS-BVH-A4) , which 
comprises a BVH-A4 protein polypeptide from serotype III Group B streptococcus 
(GBS) strain COH1 comprises: 

(a) a polypeptide comprising 1055 amino acids (I) fully defined in the 
specification; 

(b) a polypeptide having at least 80-95% identity to a second polypeptide having 



(c) a polypeptide capable of raising antibodies having binding specificity for (I); 

(d) an epitope bearing portion of (a) ; 

(e) fragments or analogs of (a), (b) , (c) or (d) ; 

(f) any of the polypeptides of (a) -(e), where the N-terminal Met residue is 
deleted; or 

(g) any of the polypeptide of (a) -(e), where the secretory amino acid sequence is 
deleted. 

INDEPENDENT CLAIMS are also included for the following: 

(1) An isolated polynucleotide comprising a sequence: 

(a) encoding the GBS-BVH-A4 polypeptide; 

(b) ' having 3168 bp fully defined in the specification; or 

(c) that is complementary of (a) or (b) ; 

(2) Vectors comprising the polynucleotide, where the DNA is operably linked to an 
expression control region; 

(3) Host cells comprising the vector; 

(4) Producing (Ml) the GBS-BVH-A4 polypeptide; 
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(5) Chimeric polypeptides comprising two or more GBS-BVH-A4 polypeptides, or its 
fragments or analogs, provided that the polypeptides are linked so as to form a 
chimeric polypeptide; 

(6) A pharmaceutical composition comprising the GBS-BVH-A4 polypeptide, and a 
pharmaceutical carrier, diluent or adjuvant; 

(7) Diagnosing (M2) GBS bacterial infection in a host susceptible to* GBS infection; 
and 

(8) A kit comprising the GBS-BVH-A4 polypeptide for detecting or diagnosing 
streptococcal infection. 

ACTIVITY - Antibiotic; Immunostimulant . 

Test details are described but no results are given. 

MECHANISM OF ACTION - Polypeptide Therapy; Vaccine. 

Test details are described but no results are given. 

USE - The GBS -BVH -A4 polypeptide is useful for the therapeutic or prophylactic 
treatment of GBS bacterial infection in a host susceptible to GBS infection. In 
particular, the GBS -BVH -A4 polypeptide is useful for treating or preventing GBS 
infection in a neonate or infant (e.g. sepsis, meningitis, pneumonia, cellulitis, 
osteomyelitis, septic arthritis, endocarditis or epiglotitis) , in a pregnant woman 
(e.g. mild urinary tract infection to life-threatening sepsis and meningitis, 
osteomyelitis, endocarditis, amniotis, endometritis, wound infection (post-cesarean 
or post-episiotomy) , cellulitis or fasciitis), in a non-pregnant adult (e.g. 
bacteremia, skin or soft tissue infection, pneumonia, urosepsis, endocarditis, 
peritonitis, meningitis or emphysema), or in a member of dairy herd (e.g. 
mastitis) . The composition or GBS -BVH -A4 polypeptide is also useful for treating or 
preventing streptococcal infection (all claimed) . The GBS -BVH -A4 polypeptide or 
antibody is also useful for diagnosing GBS or streptococcal infection. 

ABSTRACTED- PUB-NO: WO 200288178A 
EQUIVALENT-ABSTRACTS : 

CHOSEN-DRAWING: Dwg.0/2 

DERWENT- CLASS : B04 D16 

CPI-CODES: B04-B04C1; B04-C01G; B04-E01; B04-F0100E; B04-F01E; B04-N0300E; B04- 
N03E; B11-C07A; B11-C08E; B12-K04A4; B14-A01B2; B14-C03; B14-C09; B14-F01; B14-G01; 
B14-K01; B14-N01; B14-N07; B14-N16; B14-S06; B14-S11B; D05-C12; D05-H09; D05-H12A; 
D05-H12B; D05-H12E; D05-H14B; D05-H17A5; D05-H17B5; D05-H17C; 
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DOCUMENT-IDENTIFIER: US 20040052781 Al 

TITLE: Vaccine compositions comprising Streptococcus pneumoniae polypeptides having selected 
structural motifs 



Abstract Paragraph : 

A vaccine composition is disclosed that comprises polypeptides and fragments of polypeptides 
containing histidine triad residues or coiled-coil regions, some of which polypeptides or fragments lie 
between 80 and 680 residues in length. Also disclosed are processes for preventing infection caused by 
S. pneumoniae comprising administering of vaccine compositions. 

Summary of Invention Paragraph : 

[0002] This invention relates generally to the field of bacterial antigens and their use, for example, as 
immunogenic agents in humans and animals to stimulate an immune response. More specifically, it 
relates to the vaccination of mammalian species with a polypeptide comprising at least one conserved 
histidine triad residue (HxxHxH) and at least one helix-forming polypeptide obtained from 
Streptococcus pneumoniae as a mechanism for stimulating production of antibodies that protect the 
vaccine recipient against infection by a wide range of serotypes of pathogenic S. pneumoniae . Further, 
the invention relates to antibodies against such polypeptides useful in diagnosis and passive immune 
therapy with respect to diagnosing and treating such pneumococcal infections. 

Summary of Invention Paragraph : 

[0004] Streptococcus pneumoniae is a gram positive bacteria which is a major causative agent in 
invasive infections in animals and humans, such as sepsis, meningitis, otitis media and lobar pneumonia 
(Tuomanen et al. New Engl. J. Med. 322:1280-1284 (1995)). As part of the infective process, 
pneumococci readily bind to non-inflamed human epithelial cells of the upper and lower respiratory tract 
by binding to eukaryotic carbohydrates in a lectin-like manner (Cundell et al., Micro. Path. 17:361-374 
(1994)). Conversion to invasive pneumococcal infections for bound bacteria may involve the local 
generation of inflammatory factors which may activate the epithelial cells to change the number and 
type of receptors on their surface (Cundell et al., Nature, 377:435-438 (1995)). Apparently, one such 
receptor, platelet activating factor (PAF) is engaged by the pneumococcal bacteria and within a very 
short period of time (minutes) from the appearance of PAF, pneumococci exhibit strongly enhanced 
adherence and invasion of tissue. Certain soluble receptor analogs have been shown to prevent the 
progression of pneumococcal infections (Idanpaan-Heikkila et al., J. Inf. Dis., 176:704-712 (1997)). A 
number of various other proteins have been suggested as being involved in the pathogenicity of S. 
pneumoniae . There remains a need for identifying polypeptides having epitopes in common from 
various strains of S. pneumoniae in order to utilize such polypeptides as vaccines to provide protection 
against a wide variety of S. pneumoniae . 

Summary of Invention Paragraph : 

[0005] In accordance with the present invention, there is provided vaccines and vaccine compositions 
that include polypeptides obtained from S. pneumoniae and/or variants of said polypeptides and/or 
active fragments of such polypeptides. 

Brief Description of Drawings Paragraph : 

[0013] FIG. 3 is a western blot demonstrating the ability of antisera raised against recombinant Sp36 
derived from strain Norway type 4 to react with Sp36 of heterologous strains. Total cell lysates were 
immunoblotted with mouse antisera to Sp36. A band representing Sp36 protein was detected in all 23 S. 
pneumoniae strains tested, which included isolates from each of the 23 pneumococcal serotypes 
represented in the current polysaccharide vaccine. 



http://westbrs:9000Mn/gate.exe?f=DOC 6/20/05 



Record Display Form 



Page 2 of 3 



Brief Description of Drawings Paragraph : 

[0018] FIG. 8 shows the results of immunization of mice with PhtD recombinant protein, which leads to 
protection from lethal sepsis. C3H/HeJ (Panel A and B) or Balb/cByJ (Panel C) mice were immunized 
subcutaneously with PhtD protein (15 .mu.g in 50 .mu.l PBS emulsified in 50 .mu.l complete Freund's 
adjuvant (CFA)). The recombinant PhtD protein used in protection experiments consisted of 819 amino 
acid residues, starting with the cysteine (residue 20). A group of 10 sham-immunized mice received PBS 
with adjuvant. A second immunization of 15 .mu.g protein with incomplete Freund's adjuvant (IF A) was 
administered 3 weeks later; the sham group received PBS with IF A. Blood was drawn (retro-orbital 
bleed) at week 7; and sera from each group was pooled for analysis of anti-PhtD antibody by ELISA. 
Mice were challenged at week 8 by an intraperitonial (i.p.) injection of approximately 550 CFU S. 
pneumoniae strain SJ2, serotype 6B (Panel A), 850 CFU of strain EF6796, serotype 6A (Panel B) or 450 
CFU of strain EF5668, serotype 4 (Panel C). In preliminary experiments, the LD.sub.50 for strain SJ2 
and EF6796 were determined to be approximately 10 CFU for both strains. The LD.sub.50 for strain 
EF5668 was determined to be <5 CFU. Survival was determined in all groups over the course of 15 days 
following challenge. Data are presented as the percent survival for a total of 10 mice per experimental 
group. Two-sample Log-rank test was used for statistical analysis comparing recombinant Pht 
immunized mice to sham-immunized mice. 

Detail Description Paragraph : 

[0033] In accordance with a further aspect of the invention, a vaccine of the type hereinabove described 
is administered for the purpose of preventing or treating infection caused by S. pneumoniae . 

Detail Description Paragraph : 

[0040] In still another aspect the present invention relates to a method of using one or more antibodies 
(monoclonal, polyclonal or sera) to the polypeptides of the invention as described above for the 
prophylaxis and/or treatment of diseases that are caused by pneumococcal bacteria. In particular, the 
invention relates to a method for the prophylaxis and/or treatment of infectious diseases that are caused 
by S. pneumoniae . In a still further preferred aspect, the invention relates to a method for the 
prophylaxis and/or treatment of otitis media, nasopharyngeal, bronchial infections, and the like in 
humans by utilizing a vaccine of the present invention. 

Detail Description Paragraph : 

[0054] The present invention further relates to variants of polynucleotides. The variants of the 
polynucleotides may be a naturally occurring allelic variant of the polynucleotides or a non-naturally 
occurring variant of the polynucleotides. The variants include variants in which one or more bases are 
substituted, deleted or inserted. Complements to such coding polynucleotides may be utilized to isolate 
polynucleotides encoding the same or similar polypeptides. In particular, such procedures are useful to 
obtain native immunogenic portions of polypeptides from different serotypes of S. pneumoniae^ which is 
especially useful in the production of "chain" polypeptide vaccines containing multiple immunogenic 
segments. 

Detail Description Paragraph : 

[0088] The genomic DNA used as target for amplification was isolated from S. pneumoniae Norway 
strain (serotype 4), the same strain used for genomic sequencing. The complete sequence of the Sp36 
gene (SEQ ID NO:9), and its predicted amino acid sequence (SEQ ID NO:8), are given in the Sequence 
Listing appended hereto. It was noted that the predicted amino acid sequence included a hydrophobic 
leader sequence followed by a sequence (LSVC) similar to the consensus sequence for Type II signal 
peptidase (LxxC, in which both x's typically represent small amino acids). Primers (listed as SEQ ID 
NOS:l-3) were designed that would amplify the Sp36 gene and allow its cloning into pQElO and 
expression as a histidine-tagged protein lacking the signal sequence for purification by nickel -affinity 
chromatography. Cloning of the fragment amplified by SEQ ID Nos 1 and 3 would result in a protein 
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containing amino acids 2 through 800 of Sp36; cloning of the fragment amplified by SEQ ID Nos 2 and 

3 would result in a protein containing amino acids 7 through 800 of Sp36 (amino acid numbers refer to 
SEQIDNO:8). 

Detail Description Paragraph : 

[0090] In each of the three experiments shown in FIGS. 1A-1C, C3H/HeJ mice (10/group) were 
immunized intraperitoneally (i.p.) with Sp36 protein (15 .mu.g in 50 .mu.l PBS emulsified in 50 .mu.l 
complete Freund's adjuvant (CFA)). A group of 10 sham-immunized mice received PBS with adjuvant. 
A second immunization of 15 .mu.g protein with incomplete Freund's adjuvant (IF A) was administered 

4 weeks later; the sham group received PBS with IF A. Blood was drawn (retro-orbital bleed) at weeks 3, 
6, and 9; and sera from each group were pooled for analysis of anti-Sp36 antibody by ELISA. Mice were 
challenged at week 10 by an i.p. injection of approximately 500 CFU S. pneumoniae strain SJ2 
(serotype 6B; provided by P. Flynn, St. Jude Children's Research Hospital, Memphis, Tenn.). In 
preliminary experiments, the LD.sub.50 of this strain was determined to be approximately 10 CFU. 
Mice were monitored for 14 days for survival. 

Detail Description Paragraph : 

[0099] C3H/HeJ mice (10 mice/group) were passively immunized by two i.p. injections of 100 .mu.l of 
rabbit serum. The first injection was administered twenty-four hours before challenge with 172 cfu of S. 
pneumoniae strain SJ2, and the second injection was given four hours after challenge. FIG. 2 shows the 
survival of mice after infection with two different strains of pneumococci. 

Detail Description Paragraph : 

[0102] Conservation of Sp36 Among Strains of S. pneumoniae 
Detail Description Paragraph : 

[0105] The mouse anti-Sp36 sera detected two major bands with apparent molecular weights of 97 and 
100 kDa in all 23 pneumococcal lysates tested (shown in FIG. 3). The Sp36 signals obtained from S. 
pneumoniae serotypes 1,5, 17F and 22F were lower, indicating either that the level of Sp36 expression 
is reduced in these strains, or that Sp36 in these strains is antigenically different. 

Detail Description Paragraph : 

[0112] These experiments indicate that Sp36 is recognized by the human immune system and suggest 
that antibodies able to bind the Sp36 protein may be produced during natural S. pneumoniae infection in 
humans. Since the patients were infected with a variety of pneumococcal strains, these data also support 
the idea that Sp36 is antigenically conserved. 

CLAIMS: 

2. A process for preventing infection caused by S. pneumoniae comprising: administering the vaccine of 
claim 1. 
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L9: Entry 10 of 34 



File: PGPB 



Sep 4, 2003 



DOCUMENT- IDENTIFIER : US 20030165528 Al 
TITLE: Streptococcus pyogenes antigens 

Detail Description Paragraph : 

[0110] An ORF which shares 62% with the S. pyogenes BVH-P1 gene was initially 
presented in the patent application PCT/CA99/00114 which' described Group B 
streptococcus antigens. BVH-PL gene was also found to share homology (62% identity) 
with an ORF present in the genome of S. pneumoniae (The Institute for Genomic 
Research) . 
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DOCUMENT-IDENTIFIER: US 6582706 Bl 

TITLE: Vaccine compositions comprising Streptococcus pneumoniae polypeptides having selected 
structural MOTIFS 



Brief Summary Text (2): 

This invention relates generally to the field of bacterial antigens and their use, for example, as 
immunogenic agents in humans and animals to stimulate an immune response. More specifically, it 
relates to the vaccination of mammalian species with a polypeptide comprising at least one conserved 
histidine triad residue (HxxHxH-SEQ ED NO: 12) and at least one helix-forming polypeptide obtained 
from Streptococcus pneumoniae as a mechanism for stimulating production of antibodies that protect the 
vaccine recipient against infection by a wide range of serotypes of pathogenic S. pneumoniae . Further, 
the invention relates to antibodies against such polypeptides useful in diagnosis and passive immune 
therapy with respect to diagnosing and treating such pneumococcal infections. 



http://westbrs:9000^in/cgi-birl/accum_query.pl?MODE=%20%20%20%20Display o ^20 o /o20... 6/20/05 



ExPASy BLAST2 Interface 



Page 1 of 3 



Fi tr 

I? tr 

F\ tr 

f?l tr 

□ tr 
f?i tr 
f?i tr 
Fi tr 

tr 

□ tr 

□ tr 
& tr 
P tr 
15! tr 

□ 
□ 

n 
n 

n 
n 

□ 

n 

□ 
□ 

n 
n 

□ 
□ 

D 
D 



tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
'tr 
tr 
tr 
tr 
tr 
tr 
tr 
sp 
tr 



Q8DQ07 _STRR6 Pneumococcal histidine triad protein E [phtE] [S. 

Q6WNQ7 _STRPN* Surface protein BVH-3 [bvh-3] [Streptococcus pne. 

Q9ANY1 _STRPN Pneumococcal histidine triad protein E precursor. 

Q6WNQ5 _STRPN Surface protein BVH-3 (Fragment) [bvh-3] [Strept. 

Q8CWR4 _STRR6 Histidine Motif-Containing protein [phpA] [Strep. 

Q8DPQ2 _STRR6 Pneumococcal histidine triad protein A [phtA] [S. 

Q9AG7 4 _STRPN PhpA [phpA] [Streptococcus pneumoniae] 

Q9AHT9 _STRPN Pneumococcal histidine triad A protein [phtA] [S. 

Q8DQ08 _STRR6 Pneumococcal histidine triad protein D [phtD] [S. 

Q6T8D7 _STRPN Pneumococcal protein BVH-11-3 [Streptococcus pne. 

Q97QM8 _STRPN Conserved domain protein [SP1175] [Streptococcus. 

Q 9 ANY 2 _STRPN Pneumococcal histidine triad protein D precursor. 

Q97QM9 _STRPN Conserved domain protein [SP1174] [Streptococcus. 

Q 9 ANY 3 _STRPN Pneumococcal histidine triad protein B precursor. 

Q6WNQ3 _STRPN Surface protein BVH-11 (Fragment) [bvh-11] [Stre. 

Q6WNP8 _STRPN Surface protein BVH-11-2 (Fragment) [bvh-11-2] [. 

Q6WNQ1 _STRPN Surface protein BVH-11 (Fragment) [bvh-11] [Stre. 

Q6WNP5 _STRPN Surface protein BVH-11-2 (Fragment) [bvh-11-2] [. 

Q6WNP9 _STRPN Surface protein BVH-11-2 (Fragment) [Streptococc . 

Q6T304 _STRPN Surface protein BVH-11 (Fragment) [bvhll] [Strep. 

Q6WNQ0 _STRPN Surface protein BVH-11 (Fragment) [bvh-11] [Stre. 

Q6WNP6 J3TRPN Surface protein BVH-11-2 (Fragment) [bvh-11-2] [. 

Q8NZ82 _STRP8 Hypothetical protein spyM18_2072 [spyM18_2072] [. 

Q8E4U1 _STRA3 Hypothetical protein gbsl306 [gbs!306] [Streptoc. 

Q8DZ81 _STRA5 Streptococcal histidine triad family protein [SA. 

Q5X9R2 _STRP6 Streptococcal histidine triad protein [M6_Spyl71. 

Q9ZHG7 _STRAG Hypothetical protein [Streptococcus agalactiae] 

Q99XV4 _STRPY Hypothetical protein SPy2006 [SPy2006] [Streptoc. 

Q8K5Q1 _STRP3 Histidine triad protein [SpyM3_1724] [Streptococ. 

Q93GT5 _STRPY Histidine triad protein of group A streptococci . 

Q8E338 _STRA3 Hypothetical protein gbsl925 [gbsl925] [Streptoc. 

Q877Y2 J3TRP3 Hypothetical protein SPsl722 [SPsl722] [Streptoc. 

Q9AE21 _STRAG Hypothetical protein (Fragment) [Streptococcus a. 

Q8DQ06 _STRR6 Pneumococcal histidine triad protein E, truncati. 

Q8E029 __STRA5 Hypothetical protein SAG0907 [SAG0907] [Streptoc. 

Q8E5R2 _STRA3 Hypothetical protein gbs0918 [gbs0918] [Streptoc. 

Q8P0G5 _STRP8 Putative internalin A [ spyM18_1373 ] [Streptococc. 

Q8K714 _STRP3 Putative internalin A [inlA] [Streptococcus pyog. 

Q5XBJ5 _STRP6 Internalin protein [M6_Spyl083] [Streptococcus p. 

Q99Z76 _STRPY Putative internalin A [inlA] [Streptococcus pyog. 

Q5AWR8 _EMENI Hypothetical protein [AN7262.2] [Aspergillus nid. 

Q6HCJ0 _BACHK Cell division protein [ftsK] [Bacillus thuringie. 

P45386 IGA4_HAEIN Immunoglobulin Al protease precursor (EC 3... 

Q8ISF7 _CAEEL 2MDa_l protein [isof] [Caenorhabditis elegans] 
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□ tr Q8ISF6 _CAEEL 2MDa_2 protein [isof] [Caenorhabditis elegans] 46 0. 

□ sp P16053 NFM_CHICK Neurofilament triplet M protein (160 kDa neu. . . 45 0. 

□ sp Q97QP7 IGA1A_STRPN Immunoglobulin Al protease precursor (EC 3... ±± 0. 

□ tr Q869E1 _DICDI DNA ligase I (EC 6.5.1.1) ( Polydeoxyribonucleoti . . . 4.4 0. 

□ tr Q8IB63 _PLAF7 Hypothetical protein PF08_0035 [PF08_0035] [Plas... 44 0. 

□ tr Q9VC00 _DROME CG13648-PA [CG13648] [Drosophila melanogaster (F. . . 44 0. 

□ tr Q5TUJ9 _ANOGA ENSANGP0 00 0002 912 0 [ ENSANGG00000022532 ] [Anophel . . . 44 0, 
fj tr Q839C3 _ENTFA N-acetylmuramoyl-L-alanine amidase, family 4 [EF. . . 4_4 0. 

□ tr Q7PR93 _AN0GA ENSANGP00000010616 [ENSANGG00000008 127 ] [Anophel... 1:1.0. 
I™] tr 073793 _SERCA Neurofilament medium subunit [Serinus canaria (C. . . _43 0. 
O tr Q9FN97 _ARATH Transposon protein-like [Arabidopsis thaliana (M. . . 43 0. 

□ tr Q54U33 _DICDI Hypothetical protein [DDB0218298] [Dictyostelium. . . 43 0. 
O tr Q963T1 _PLARE Glutamate-rich protein (Fragment) [GLURP] [Plasm... 43. 0. 
n tr Q07594 _ENTHI K2 protein (Fragment) [Entamoeba histolytica] : 13 0. 

□ tr Q6PK21 _HUMAN OGFR protein [Homo sapiens (Human)] 13 0. 
O sp_vs Q9NZT2-2 Splice' isoform 2 of Q9NZT2 [OGFR] [Homo sapiens (... 43 0, 
fj sp Q9NZT2 OGFR_HUMAN Opioid growth factor receptor (OGFr) (Zeta-... 43 0. 

□ tr Q6HBX5 _BACHK Hypothetical protein [BT9727_4640] [Bacillus thu... 43 0. 
171 tr Q7RQS8 _PLAYO Retinitis pigmentosa GTPase regulator-like prote... 43_ 0. 

□ tr 077320 _PLAF7 Hypothetical protein MAL3P3.3 [MAL3P3.3] [Plasmo... 43 0. 
O tr Q5HQ7 4 _STAEQ Pyruvate dehydrogenase complex E2 component, dih. . . 12 0, 
O tr Q9L4Z1 _STAEP Pyruvate dehydrogenase complex subunit E2 [pdhC] . . . 42 0. 

□ tr Q9VGW4 _DROME CG14692-PA [CG14692] [Drosophila melanogaster (F. . . 42_ 0. 
H! tr Q90307 _CARAU Carassius auratus [Carassius auratus (Goldfish)] 42_ 0. 
O tr Q898B0 _CLOTE Hypothetical protein CTC00555 [CTC00555] [Clostr. . . ,42 0. 
O tr Q6R4Z8 _CAPBU Dehydrin cor29 [Capsella bursa-pastoris (Shepher. . . 12 0, 

□ tr Q50R78 _ENTHI Hypothetical protein [298.t00012] [Entamoeba his... 42 ( 
^ tr Q8MMQ1 _DICDI Similar to Required for the transfer of mannosyl... 42 ( 
O tr Q8IBL1 _PLAF7 Hypothetical protein MAL7P1 . 129 [MAL7P1.129] [PI... 4£ ( 

□ tr Q55K21 _CRYNE Hypothetical protein [CNBK1650] [Cryptococcus ne... 42_ ( 
^ S P Q8CT13 ODP2_STAEP Dihydrolipoyllysine-residue acetyltransf era . . . 4JL ( 
0 tr 033741 _STRPN SpsA protein [Streptococcus pneumoniae] 41_ C 

□ tr Q8I1W5 _PLAF7 Hypothetical protein PFD0320c (Fragment) [PFD032... 41 ( 

□ tr Q5TVN3 _ANOGA ENSANGP00000027 660 (Fragment) [ENSANGG0000002494 . . . 41 C 
O tr Q07593 _ENTHI K2 protein (Fragment) [Entamoeba histolytica] ±1_ C 

S P Q28820 TRDN_RABIT Triadin [TRDN] [Oryctolagus cuniculus (Rabb... 41 ( 

PI sp Q59947 IGA1_STRR6 Immunoglobulin Al protease precursor (EC 3.... 41_ ( 

□ tr Q7SXW9 _BRARE Wu:fc44al0 protein (Fragment) [wu:fc44al0] [Brae... 41 ( 
G tr Q28688 _RABIT Neurof ilament-H (Fragment) [Oryctolagus cuniculu. . . jLI C 
G tr Q94674 _PLAGA Thrombospondin-related anonymous protein (Fragme... 11 C 
G tr Q5WNG8 _CAEBR Hypothetical protein CBG08011 [CBG08011] [Caenor... 11 ( 
O tr ' Q6BLN0 _DEBHA Similar to ca | CA2433 1 IPF12959 Candida albicans I... 4JL C 

□ sp_vs Q28820-4 Splice isoform Cardiac 3 of Q28820 [TRDN] [Orycto... 41 ( 

□ sp_vs Q28820-6 Splice isoform Skeletal 3 of Q28820 [TRDN] [Oryct... 41, ( 
fj sp Q54875 IGA1B_STRPN Immunoglobulin Al protease precursor (EC 3... 4_0 ( 
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□ tr Q568LQ _BRARE Wu : f c44al0 [wu:fc44al0] [Brachydanio rerio (Zebr. . . . 40 
0 tr Q4ZHU3 _STAXY Biof ilm-associated protein [bap] [Staphylococcus... A0_ 
n tr Q510B7 _ENTHI Hypothetical protein [113.t00023] [Entamoeba his... .40 

□ tr Q8IJ56 _PLAF7 Glutamate-rich protein [PF10_0344] [Plasmodium f . . . 40 
tr Q9GUY4 _PENJP Crustocalcin [DD4(ccn)] [Penaeus japonicus (Kuru. . . 4^ 

□ tr Q9GTX2 _PLAFA Glutamate-rich protein [GLURP] [Plasmodium falci... 40 

□ tr Q61US9 _CAEBR Hypothetical protein CBG05170 [CBG05170] [Caenor. . . 40 
HI tr Q6FWC0 _CANGA Candida glabrata strain CBS138 chromosome D comp. . . 4_0 

tr Q6CTIQ _KLULA Similar to sp|Q05050 Saccharomyces cerevisiae YM. . . 40 

^ tr Q879S6 _XYLFT Hemagglutinin-like secreted protein [pspA] [Xyle. . . _40 

□ tr 017102 _CAEEL Hypothetical protein F42G2 . 6 [F42G2.6] [Caenorha... 40 
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CLUSTAL W (1.74) multiple sequence alignment 



| Q8DQ07 | Q8DQ07_STRR6 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENK-DNN 

| Q6WNQ7 | Q6WNQ7_STRPN MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENK-DNN 

| Q 3 ANY I | Q9ANY1_STRPN MK F S KK Y I AAG S AV I V S L S L C A Y ALNQ H R S Q E N K - DKN 

| Q6WNQ5 | Q6WNQ5_STRPN CAYALNQHRSQENK-DNN 

| Q8DPQ2 | Q8DPQ2_STRR6 MQLEISNRKRVSMKINKKYLVG-SAAALILSVCSYELGLYQARTVK-ENN 

| Q9AG74 | Q9AG74_STRPN MKINKKYLVG-SAAALILSVCSYELGLYQARTVK-ENN 

| Q9AHT9 | Q9AHT9_STRPN MKINKKYLVG-SAAALILSVCSYELGLYQARTVK-ENN 

| Q8DQ08 | Q8DQ08_STRR6 MKINKKYLAG-SVAVLALSVCSYELGRHQAGQVKKESN 

| Q 9 ANY 2 | Q9ANY2_STRPN MKI NKKYLAG - S VAVLALS VCS YE LGRHQAGQVKKE SN 

| Q9ANY3 | Q9ANY3_STRPN MKI NKKY LAG- S VAVLALS VCS YE LGRYQAGQDKKESN 

★ . * ★ ... ★ . ★ 



IQ8DQ07 
IQ6WNQ7 
| Q9ANY1 
IQ6WNQ5 
| Q8DPQ2 
IQ9AG74 
IQ9AHT9 
IQ8DQ08 
| Q 9 ANY 2 
IQ9ANY3 



IQ8DQ07 
| Q6WNQ7 
IQ9ANY1 
| Q6WNQ5 
IQ8DPQ2 
IQ9AG74 
IQ9AHT9 
IQ8DQ08 
IQ9ANY2 
| Q 9 ANY 3 



|Q8DQ07_ 
| Q6WNQ7] 
| Q9ANY1~ 
| Q6WNQ5_ 
IQ8DPQ2" 
|Q9AG74~ 
IQ9AHT9] 
|Q8DQ08_ 
| Q9ANY2_ 
| Q9ANY3 



|Q8DQ07_ 
IQ6WNQ7" 
| Q9ANY1~ 
| Q6WNQ5" 
|Q8DPQ2~ 
|Q9AG74~ 
|Q9AHT9~ 
| Q8DQ08_ 
| Q9ANY2~ 
| Q 9 ANY 3~ 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



RVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHY 
RVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHY 
RVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHY 
RVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHY 
RVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHY 
RVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHY 
RVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHY 
RVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHY 
RVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHY 
RVAYIDGDQAGQKMNLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHY 

HYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVY 
HYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVY 
HYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVY 
HYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVY 
HYYNGKVPYDAIFSEELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVY 
HYYNGKVPYDAIISEELLMKDPNYQLKDEDIISEIKGGYVIKVDGKYYVY 
HYYNGKVPYDAIISEELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVY 
HY YNGKVPYDAI I SEELLMKDPNYQLKDSDI VNE I KGGYVIKVDGKY YVY 
HYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVY 
HY YNGKVPYDAI I SEELLMKDPNYQLKDSDI VNE I KGGYVIKVNGKYYVY 



I Q8DQ07 I Q8DQ07_STRR6 LKDA7\HADNVRTKDE INRQKQEHVKDNE KVNSNVAVT^RSQGRYTTN 

I Q6WNQ7 I Q6WNQ7_STRPN LKDAAHADNVRTKDE INRQKQEHVKDNE KVNSNVAVARSQGRYTTN 

I Q9ANY1 1 Q9ANY1_STRPN LKDAAHADNVRTKDE INRQKQEHVKDNE KVNSNVAVARSQGRYTTN 

I Q6WNQ5 I Q6WNQ5_STRPN LKDAAHADNVRTKDE INRQKQEHVKDNE KVNSNVAVARSQGRYTTN 

I Q8DPQ2 I Q8DPQ2_STRR6 LKDAAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTD 
I Q9AG74 I Q9AG7 4_STRPN ' LKDAAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTD 

I Q9AHT9 I Q 9AHT 9_STR PN LKDAAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTD 

I Q8DQ08 I Q8DQ08_STRR6 LKDAAHADNIRTKEEIKRQKQERSHNHN S RADN AVAAARAQ GR Y T T D 

I Q 9 ANY 2 I Q9ANY2_STRPN LKDAAHADNIRTKEEIKRQKQEHSHNHGG — GSNDQAWAARAQGRYTTD 

I Q 9 ANY 3 I Q 9 ANY 3 STRPN LKDAAHADNIRTKEEIKRQKQERSHNHN S RADNAVAAARAQ GR Y T T D 



★★★★★★★★★ . • *★ • ★★★★★ . 



IQ8DQ07 
IQ6WNQ7 
IQ9ANY1 
IQ6WNQ5 
I Q8DPQ2 
IQ9AG7 4 
IQ9AHT9 
IQ8DQ08 
I Q9ANY2 



|Q8DQ07_ 
I Q6WNQ7_ 
I Q9ANYl" 
I Q6WNQ5~ 
IQ8DPQ2] 
IQ9AG7 4" 
|Q9AHT9_ 
IQ8DQ08] 
I Q 9 ANY 2" 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 



DGYVFNPADI 
DGYVFNPADI 
DGYVFNPADI 
DGYVFNPADI 
DGYIFNASDI 
DGYIFNASDI 
DGYIFNASDI 
DGYIFNASDI 
DGYIFNASDI 



IEDTGNAYIVPHGGHYHYI 
IEDTGNAYIVPHGGHYHYI 
IEDTGNAYIVPHGGHYHYI 
IEDTGNAYIVPHRGHYHYI 
IEDTGDAYIVPHGDHYHYI 
I E DT GDAY I VPHGDH Y H Y I 
IEDTGDAYIVPHGDHYHYI 
IEDTGDAYIVPHGDHYHYI 
IEDTGDAYIVPHGDHYHYI 



PKS DLS AS E LAAAKAH LAGK - 
PKSDLSASELAAAKAHLAGK- 
PKS DLSASE LAAAKAH LAGK- 
PKS DLS AS E LAAAKAH LAGK - 
PKNE LS AS E LAAAE AFLS GRG 
PKNE LS AS E LAAAKAFLS GRG 
PKNE LS AS E LAAAE AFLS GRG 
PKS DLS AS E LAAAQAYWNGK - 
PKNE L S AS E LAAAE AYWNGK- 
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DGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGK- 
***.**_.*★★**** : ****** 9 ******* ^ -***★★****.* t *• 

NMQP-SQLSYSSTASD NNTQSVAKGSTSKPANKSEN 

NMQP-SQLSYSSTASD NNTQSVAKGSTSKPANKSEN 

NMQP-SQLSYSSTASD NNTQSVAKGSTSKPANKSEN 

NMQP-SQLSYSSTASD NNTQSVAKGSTSKPANKSEN 

NLSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSND 
NLSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSND 
NLSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSND 

QGSRPSSSSSHNANPAQPRLSENHNLTVTPTYHQN-QGEN 

QGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQN-QGEN 

QGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQN-QGEN 

* r . . : : . : . : . * : . : : 

LQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIP 
LQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIP 
LQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIP 
LQS LLKE LYDS PSAQRY SESDGLVFD PAKI I SRT PNGVAI PHGDHYHFIP 
IDSLLKQLYKLPLSQRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIP 
IDSLLKQLYKLPLSQRHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIP 
IDSLLKQLYKLPLSQRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIP 
ISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIP 
ISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIP 
ISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIP 
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SAETAAGIDSKLAKQESLSHK LGTKKTD LPSSDR 

: : . * . : . ** : : : : : : 

NHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKD 
NHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKD 
NHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPSNAKEMKD 
NHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPSNAKEMKD 



EFYDKAYNLLTEAHKALFEN-KGRNSDFQALDKLLERLNDEST N 

EFYNKAYDLLARIHQDLLDN-KGRQVDFEALDNLLERLKDVSS D 

EFYDKAYNLLTEAHKALFXN-KGRNSDFQALDKLLERLNDEST N 

EFYNKAYDLLARIHQDLLDN-KGRQVDFEALDNLLERLKDVSS D 

EFYNKAYDLLARIHQDLLDN-KGRQVDFEALDNLLERLKDVPS D 

E FYNKAYDLLARI HQDLLDN-KGRQVDFEALDNLLERLKDVSS D 

:.* : * . : :..•*** • • * 



LDKKIEEKIAGIMKQYGVKRES I WNKEKNAI I Y PHGDHHHADPI DEHKP 
LDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPIDEHKP 
LDKKIEEKIAGIMKQYGVKRESIVVNKEKNAIIYPHGDHHHADPIDEHKP 
LDKKIEEKIAGIMKQYGVKRE SI WNKEKNAI I Y PHGDHHHADPI DEHKP 



KEKLVDDLLAFLAPITHPER LGKPNSQIEYTE DEVRIAQL 

KVKLVDDILAFLAPIRHPER LGKPNAQITYTD DEIQVAKL 

KEKLVDDLLAFLAPITHPER LGKPNSQIEYTE DEVRIAQL 

KVKLVDDILAFLAPIRHPER LGKPNAQITYTD DEIQVAKL 

KVKLVDDILAFLAPIRHPER LGKPNAQITYTD DEIQVAKL 

KVKLVEDILAFLAPIRHPER LGKPNAQITYTD DEIQVAKL 

; * * * . . * : : 



VGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNF 
VGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNF 
VGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNF 
VGI GHS HSNYE LFKPEE GVAKKE GNKVYTGEE LTNWNLLKNST FNNQNF 
ADK-YTTSDGYIFDEHD-IISDEGD-AYVTPHMGHS-HWIGKDSLSDKEK 
AGK-YTTEDGYIFDPRD-ITSDEGD-AYVTPHMTHS-HWIKKDSLSEAER 
ADK-YTTSDGYIFDEHD-IISDEGD-AYVTPHMGHS-HWIGKDSLSDKEK 
AGK-YTTEDGYIFDPRD-ITSDEGD-AYVTPHMTHS-HWIKKDSLSEAER 
AGK-YTTEDGYIFDPRD-ITSDEGD-AYVTPHMTHS-HWIKKDSLSEAER 

AGK-YTAEDGYIFDPRD-ITSDEGD-AYVTPHMTHS-HWIKKDSLSEAER 
. * * * . * 



TLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVG 
TLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVG 
TLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVG 
TLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVG 

VAAQAYTKEKGI LPPS P DAD V KAN PTGDSAAAI YNRVKGE 

AAAQAYAKEKGLTPPSTDHQDSGN TEAKGAEAI YNRVKAA 

VAAQ AY T KE KGI L P P S P DAD V KAN PT GDSAAAI YNRVKGE 

AAAQAYAKEKGLTPPSTDHQDSGN TEAKGAEAI YNRVKAA 

AAAQAYAKEKGLTPPSTDHQDSGN TEAKGAEAI YNRVKAA 

AAAQAYAXEKGLTPPSTDHQDSGN TEAKGAEAI YNRVKAA 

* : . ** : * ... : . : * 



NIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTI 
NIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTI 
NIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTI 
NIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTI 

— KRIPLVR — LP-YMVEHTVEVK NGNLIIP HKDHYHNIK 

— KKVPLDR — MP-YNLQYTVEVK NGSLIIP HYDHYHNIK 

— KRIPLVR — LP-YMVEHTVEVK NGNLIIP HKDHYHNIK 

— KKVPLDR — MP-YNLQYTVEVK NGSLIIP HYDHYHNIK 

— KKVPLDR — MP-YNLQYTVEVK = NGSLIIP HYDHYHNIK 



http://www. 
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FYPFHAGDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKL 
FY PFHAGDTYLRVNPQFAVPKGTDALVRVFDE FHGNAYLENNYKVGE IKL 
FYPFHAGDTYLRVNPQFAVPKGTDALVRVFDEFKGNAYLENNYKVGEIKL 
FY PFHAGDTYLRVNPQFAVPKGTDALVRVFDE FHGNAYLENNYKVGE IKL 

FAWFDDH TYKAPNG-YTLEDLFATIK— YYVEHPDER 

FEWFDEG LY EAPKG- Y S LE DLLATVK — Y YVE HPNER 

FAWFDDH TYKAPNG-YTLEDLFATIK — YYVEHPDER 

FEWFDEG LY EAPKG- YSLE DLLATVK — YYVEHPNER 

FEWFDEG LYEAPKG-YTLEDLLATVK — YYVEHPNER 

FEWFDEG LYEAPKG-YTLEDLLATVK — YYVEHPNER 



PI PKLNQGTTRTAGNKI 
PI PKLNQGTTRTAGNKI 
PI PKLNQGTTRTAGNKI 
PI PKLNQGTTRTAGNKI 

— PHSNDG WGN — 

— PHSDNG FGN — 

— PHSNDG WGN — 

— PHSDNG FGN — 

— PHSDNG FGN — 

— PHSDNG FGN — 



P VT FMANAY LDNQ STYIVEVP ILEKENQT 

PVT FMANAY LDNQST Y I VE VP ILEKENQT 

PVTFMANAYLDNQSTYIVEVP ILEKENQT 

PVT FMANAY LDNQST Y I VE VP ILEKENQT 

ASEHVLGKK DHS 

ASDHVQRNKNGQ ADTNQT 

ASEHVLGKK DHS 

ASDHVQRNKNGQADTNQTEKPNEEKPQT 

ASDHVRKNK VDQD 

ASDHVQRNKNGQ ADTNQT 



DKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 
DKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 
DKPSILPQFKRNKAQENLKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 
DKPSILPQFKRNKAQENSKFDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 

EDP NKNFKADEE PVE — ETP-AEP 

EKP NEEKPQTEKPEE — ETPREEKP-QSEKPESP 

EDP NKNFKADEE PVE - -ET P-AE P 

EKP EEDKEHDEVSEP — THPESDEK-ENHVGLNPS-ADN 

SKP DEDKEHDEVSEP — THPESDEK-ENHAGLNPS-ADN 

EKP SEEKPQTEKPEE — ETPREEKP-QSEKPESP 



EEVPTVDPVQEKVAKFAESYG-MKLENVLFNMDGTIELYLPSGEVIKKNM 
EEVPTVDPVQEKVAKFAESYG-MKLENVLFNMDGTIELYLPSGEVIKKNM 
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E VPQVET EKVEAQLKEAEVLLAKV 

— KPTEEPEEESPEESPE ESEEPQVET EKVKEKLREAEDLLGKI 

E VPQVET EKVEAQLKEAEVLLAKV 

LYKPSTDTEE-TEEEAEDTTDEAEIPQVEH SVINAKIAEAEALLEKV 

LYKPSTDTEE-TEEEAEDTTDEAEI PQVEN SVINAKIADAEALLEKV 

— KPTEEPEE-SPEESEE PQVET EKVEEKLREAEDLLGKI 



ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVK 
ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVK 
ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVK 
ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVK 
TDSS — LKANATETLAGLRNNLTLQIMDNN-SIMMAEKLLALLKGS — N 
QNPI — IKSNAKETLTGLKNNLLFGTQDNN-TIMAEAEKLLALLKES — K 
TDSS — LKANATETLAGLRNNLTLQIMDNN-SIMAEAEKLLALLKGS — N 
TDSS — IRQNAVETLTGLKSSLLLGTKDNN-TISAEVDSLLALLKES — Q 
TDPS — IRQNAMETLTGLKSSLLLGTKDNN-TISAEVDSLLALLKES — Q 
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PENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGL 
P E N S T DN GMLN PE GNV GS D PMLD PALE E APAVD PVQE KLE K FTAS Y GLG L 
PENS TDNGMLNPE GNVGS DPMLD PALE EAPAVD PVQE KLE KFTASY GLGL 
PSSVSKEKIN 

PSSVSKEKIN 

PTPIQ 

PAPIQ 



DSVI FNMDGTIELRLPSGEVIKKNLSDLIA 
DSVI FNMDGTIELRLPSGEVIKKNLSDLIA 
DSVI FNMDGTIELRLPSGEVIKKNLSDLIA 
DSVI FNMDGTIELRLPSGEVIKKNLSDLIA 



http://www.ch.embnet.org/wwwtmp/sq.12513.aln 



6/20/05 



Page 1 of 5 



PileUp 



MSF 


1080 Type: P 


Check: 


8540 


Name : 


tr IQ8DQ07 IQ8DQ07 


_STRR6 


oo 


Len: 


Name : 


tr | Q6WNQ7 | Q6WNQ7 


JSTRPN 


oo 


Len : 


Name : 


tr | Q9ANY1 1 Q9ANYl" 


_STRPN 


oo 


Len : 


Name : 


tr|Q6WNQ5|Q6WNQ5~ 


_STRPN 


oo 


Len : 


Name : 


tr IQ8DPQ2 IQ8DPQ2" 


_STRR6 


oo 


Len : 


Name : 


tr|Q9AG74|Q9AG74 


_STRPN 


oo 


Len : 


Name : 


tr | Q9AHT9 | Q9AHT9 


JSTRPN 


oo 


Len : 


Name : 


tr IQ8DQ08 IQ8DQ08 


_STRR6 


oo 


Len : 


Name : 


tr | Q9ANY2 | Q9ANY2* 


_STRPN 


oo 


Len : 


Name : 


tr | Q9ANY3 | Q9ANY3 


_STRPN 


oo 


Len : 



// 



1080 


Check: 


6992 


Weight : 


0. 


100 


1080 


Check: 


6992 


Weight : 


0. 


100 


1080 


Check: 


7347 


Weight : 


0. 


100 


1080 


Check : 


4063 


Weight : 


0. 


100 


1080 


Check: 


1836 


Weight : 


0. 


100 


1080 


Check: 


8409 


Weight : 


0. 


100 


1080 


Check : 


9461 


Weight : 


0. 


100 


1080 


Check: 


9939 


Weight : 


0. 


100 


1080 


Check: 


1016 


Weight : 


0. 


100 


1080 


Check: 


2485 


Weight : 


0. 


100 



Q8DQ07 
Q6WNQ7 
Q9ANY1 
Q6WNQ5 
Q8DPQ2 
Q9AG74 
Q9AHT9 
Q8DQ08 
Q 9 ANY 2 
Q 9 ANY 3 



Q8DQ07 
Q6WNQ7 
Q9ANY1 
Q6WNQ5 
Q8DPQ2 
Q9AG74 
Q9AHT9 
Q8DQ08 
Q 9 ANY 2 
Q 9 ANY 3 



Q8DQ07 
Q6WNQ7 
Q9ANY1 
Q6WNQ5 
Q8DPQ2 
Q9AG74 
Q9AHT9 
Q8DQ08 
Q 9 ANY 2 
Q 9 ANY 3 



|Q8DQ07_ 
| Q6WNQ7~ 
| Q9ANY1~ 
[ Q6WNQ5~ 
IQ8DPQ2] 
IQ9AG74" 
|Q9AHT9_ 
IQ8DQ08] 
| Q9ANY2~ 
| Q 9 ANY 3~ 



|Q8DQ07_ 
IQ6WNQ7] 
| Q9ANY1~ 
| Q6WNQ5~ 
|Q8DPQ2~ 
IQ9AG7 4] 
|Q9AHT9~ 
|Q8DQ08~ 
I Q9ANY2~ 
I Q 9 ANY 3 



|Q8DQ07_ 
I Q6WNQ7~ 
|Q9ANY1_ 
I Q6WNQ5_ 
IQ8DPQ2] 
IQ9AG7 4_ 
I Q9AHT9_ 
|Q8DQ08_ 
I Q9ANY2~ 
I Q 9 ANY 3" 



STRR6 
"STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
*STRR6 
STRPN 
STRPN 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



MKFSKKYI AAGSAVIVSL SLCAYALNQH 

MKFSKKYI AAGSAVIVSL SLCAYALNQH 

MKFSKKYI AAGSAVIVSL SLCAYALNQH 

'.. . . CAYALNQH 

MQLEISNRKR VSMKINKKYL VG.SAAALIL SVCSYELGLY 

MKINKKYL VG.SAAALIL SVCSYELGLY 

MKINKKYL VG.SAAALIL SVCSYELGLY 

MKINKKYL AG . S VAVLAL SVCSYELGRH 

MKINKKYL AG . S VAVLAL SVCSYELGRH 

MKINKKYL AG . S VAVLAL SVCSYELGRY 



RVSYVDGSQS 
RVSYVDGSQS 
RVSYVDGSQS 
RVSYVDGSQS 
RVSYIDGKQA 
RVSYIDGKQA 
RVSYIDGKQA 
RVSYIDGDQA 
RVSYIDGDQA 
RVAYIDGDQA 



HYYNGKVPYD 
HYYNGKVPYD 
HYYNGKVPYD 
HYYNGKVPYD 
HYYNGKVPYD 
HYYNGKVPYD 
HYYNGKVPYD 
HYYNGKVPYD 
HYYNGKVPYD 
HYYNGKVPYD 



SQKSENLTPD 
SQKSENLTPD 
SQKSENLTPD 
SQKSENLTPD 
TQKTENLTPD 
TQKTENLTPD 
TQKTENLTPD 
GQKAENLTPD 
GQKAENLTPD 
GQKAENLTPD 



ALFSEELLMK 
ALFSEELLMK 
ALFSEELLMK 
ALFSEELLMK 
AIFSEELLMK 
AIISEELLMK 
AIISEELLMK 
AIISEELLMK 
AIISEELLMK 
AIISEELLMK 



QVS QKEGIQA 
QVSQKEGIQA 
QVS QKEGIQA 
QVSQKEGIQA 
EVSKREGINA 
EVSKREGINA 
EVSKREGINA 
EVSKREGINA 
EVSKREGINA 
EVSKREGINA 



DPNYQLKDAD 
DPNYQLKDAD 
DPNYQLKDAD 
DPNYQLKDAD 
DPNYKLKDED 
DPNYQLKDED 
DPNYKLKDED 
DPNYQLKDSD 
DPNYQLKDSD 
DPNYQLKDSD 



EQIVIKITDQ 
EQIVIKITDQ 
EQIVIKITDQ 
EQIVIKITDQ 
EQIVIKITDQ 
EQIVIKITDQ 
EQIVIKITDQ 
EQIVIKITDQ 
EQIVIKITDQ 
EQIVIKITDQ 



IVNEVKGGYI 
IVNEVKGGYI 
IVNEVKGGYI 
IVNEVKGGYI 
IVNEVKGGYV 
IISEIKGGYV 
IVNEVKGGYV 
IVNEIKGGYV 
IVNEIKGGYV 
IVNEIKGGYV 



RSQENK.DNN 
RSQENK.DNN 
RSQENK.DNN 
RSQENK.DNN 
QARTVK.ENN 
QARTVK . ENN 
QARTVK . ENN 
QAGQVKKESN 
QAGQVKKESN 
QAGQDKKESN 



GYVTSHGDHY 
GYVTSHGDHY 
GYVTSHGDHY 
GYVTSHGDHY 
GYVTSHGDHY 
GYVTSHGDHY 
GYVTSHGDHY 
GYVTSHGDHY 
GYVTSHGDHY 
GYVTSHGDHY 



IKVDGKYYVY 
IKVDGKYYVY 
IKVDGKYYVY 
IKVDGKYYVY 
IKVDGKYYVY 
IKVDGKYYVY 
IKVDGKYYVY 
IKVDGKYYVY 
IKVDGKYYVY 
IKVNGKYYVY 



Q8DQ07 |Q8DQ07_STRR6 
Q6WNQ7 I Q6WNQ7_STRPN 
Q9ANY1 IQ9ANY1 STRPN 



LKDAAHADNV RTKDEINRQK QEHVKDNE . 
LKDAAHADNV RTKDEINRQK QEHVKDNE. 
LKDAAHADNV RTKDEINRQK QEHVKDNE. 



.KVNSNVAV ARSQGRYTTN 
.KVNSNVAV ARSQGRYTTN 
.KVNSNVAV ARSQGRYTTN 
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tr 


Q6WNQ5 


Q6WNQ5_ 


_STRPN 


LKDAAHADNV 


tr 


Q8DPQ2 


Q8DPQ2_ 


_STRR6 


LKDAAHADNV 


tr 


Q9AG74 


Q9AG74~ 


_STRPN 


LKDAAHADNV 


tr 


Q9AHT9 


Q9AHT9_ 


_STRPN 


LKDAAHADNV 


tr 


Q8DQ08 


Q8DQ08 


_STRR6 


LKDAAHADNI 


tr 


Q 9 ANY 2 


Q9ANY2~ 


_STRPN 


LKDAAHADNI 


tr 


Q 9 ANY 3 


Q9ANY3~ 


_STRPN 


LKDAAHADNI 


tr 


Q8DQ07 


Q8DQ07 


_STRR6 


DGYVFNPADI 


tr 


Q6WNQ7 


Q6WNQ7~ 


JSTRPN 


DGYVFNPADI 


tr 


Q9ANY1 


Q9ANY1_ 


_STRPN 


DGYVFNPADI 


tr 


Q6WNQ5 


Q6WNQ5_ 


_STRPN 


DGYVFNPADI 


tr 


Q8DPQ2 


Q8DPQ2_ 


_STRR6 


DGYIFNASDI 


tr 


Q9AG74 


Q9AG74~ 


_STRPN 


DGYIFNASDI 


tr 


Q9AHT9 


Q9AHT9~ 


_STRPN 


DGYIFNASDI 


tr 


Q8DQ08 


Q8DQ08_ 


_STRR6 


DGYIFNASDI 


tr 


Q 9 ANY 2 


Q9ANY2~ 


_STRPN 


DGYIFNASDI 


tr 


Q 9 ANY 3 


Q9ANY3~ 


_STRPN 


DGYIFNASDI 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 




tr 


Q6WNQ7 


Q6WNQ7_ 


_STRPN 




tr 


Q9ANY1 


Q9ANY l" 


_STRPN 




tr 


Q6WNQ5 


Q6WNQ5_ 


_STRPN 




tr 


Q8DPQ2 


Q8DPQ2~ 


_STRR6 


NLSNSRTYRR 


tr 


Q9AG74 


Q9AG74~ 


_STRPN 


NLSNSRTYRR 


tr 


Q9AHT9 


Q9AHT9_ 


_STRPN 


NLSNSRTYRR 


tr 


Q8DQ08 


Q8DQ08~ 


_STRR6 




tr 


Q 9 ANY 2 


Q9ANY2~ 


_STRPN 




tr 


Q 9 ANY 3 


Q9ANY3~ 


_STRPN 




tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


LQSLLKELYD 


tr 


Q6WNQ7 


Q6WNQ7" 


_STRPN 


LQSLLKELYD 


tr 


Q9ANY1 


Q9ANY1_ 


_STRPN 


LQSLLKELYD 


tr 


Q6WNQ5 


Q6WNQ5~ 


_STRPN 


LQSLLKELYD 


tr 


Q8DPQ2 


Q8DPQ2~ 


_STRR6 


IDSLLKQLYK 


tr 


Q9AG74 


Q9AG74_ 


JSTRPN 


IDSLLKQLYK 


tr 


Q9AHT9 


Q9AHT9_ 


_STRPN 


IDSLLKQLYK 


tr 


Q8DQ08 


Q8DQ08~ 


_STRR6 


ISSLLRELYA 


tr 


Q9ANY2 


Q9ANY2~ 


_STRPN 


ISSLLRELYA 


tr 


Q 9 ANY 3 


Q9ANY3_ 


_STRPN 


ISSLLRELYA 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


YSKLSALEEK 


tr 


Q6WNQ7 


Q6WNQ7^ 


_STRPN 


YSKLSALEEK 


tr 


Q9ANY1 


Q9ANY1~ 


_STRPN 


YSKLSALEEK 


tr 


Q6WNQ5 


Q6WNQ5~ 


_STRPN 


YSKLSALEEK 


tr 


Q8DPQ2 


Q8DPQ2~ 


_STRR6 


YSQMSELEER 


tr 


Q9AG7 4 


Q9AG7 4] 


_STRPN 


YSQLSPLEEK 


tr 


Q9AHT9 


Q9AHT9] 


~STRPN 


YSQMSELEER 


tr 


Q8DQ08 


Q8DQ08" 


"STRR6 


YSQLSPLEEK 


tr 


Q9ANY2 


Q9ANY2~ 


_STRPN 


YEQMSELEKR 


tr 


Q 9 ANY 3 


Q9ANY3~ 


_STRPN 


YEQMSELEKR 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


TTSKELSSAS 


tr 


Q6WNQ7 


Q6WNQ7~ 


_STRPN 


TTSKELSSAS 


tr 


Q9ANY1 


Q9ANYl" 


~STRPN 


TTSKELSSAS 



RTKDEINRQK QEHVKDNE . . . . KVNSNVAV ARSQGRYTTN 
RTKEEINRQK QEHSQHREGG TPRNDGAVAL ARSQGRYTTD 
RTKEEINRQK QEHSQHREGG TPRNDGAVAL ARSQGRYTTD 
RTKEEINRQK QEHSQHREGG TPRNDGAVAL ARSQGRYTTD 
RTKEEIKRQK QERSHNHN. . . SRADNAVAA ARAQGRYTTD 
RTKEEIKRQK QEHSHNHGG. . GSNDQAWA ARAQGRYTTD 
RTKEEIKRQK QERSHNHN.. .SRADNAVAA ARAQGRYTTD 



IEDTGNAYIV PHGGHYHYIP KSDLSASELA AAKAHLAGK. 

IEDTGNAYIV PHGGHYHYIP KSDLSASELA AAKAHLAGK. 

IEDTGNAYIV PHGGHYHYIP KSDLSASELA AAKAHLAGK. 

IEDTGNAYIV PHRGHYHYIP KSDLSASELA AAKAHLAGK. 

IEDTGDAYIV PHGDHYHYIP KNELSASELA AAEAFLSGRG 

IEDTGDAYIV PHGDHYHYIP KNELSASELA AAKAFLSGRG 

IEDTGDAYIV PHGDHYHYIP KNELSASELA AAEAFLSGRG 

IEDTGDAYIV PHGDHYHYIP KSDLSASELA AAQAYWNGK. 

IEDTGDAYIV PHGDHYHYIP KNELSASELA AAEAYWNGK . 

IEDTGDAYIV PHGDHYHYIP KNELSASELA AAEAYWNGK. 



.NMQP.SQLS YSSTASD... NNTQSVAKGS TSKPANKSEN 
.NMQP.SQLS YSSTASD... NNTQSVAKGS TSKPANKSEN 
.NMQP.SQLS YSSTASD... NNTQSVAKGS TSKPANKSEN 
.NMQP.SQLS YSSTASD... NNTQSVAKGS TSKPANKSEN 
QNSDNTSRTN WVPSVSNPGT TNTNTSNNSN TNSQASQSND 
QNSDNTSRTN WVPSVSNPGT TNTNTSNNSN TNSQASQSND 
QNSDNTSRTN WVPSVSNPGT TNTNTSNNSN TNSQASQSND 
QGSRPSSSSS HNANPAQPRL SENHNLTVTP TYHQN . QGEN 
QGSRPSSSSS YNANPAQPRL SENHNLTVTP TYHQN. QGEN 
QGSRPSSSSS YNANPAQPRL SENHNLTVTP TYHQN. QGEN 



SPSAQRYSES DGLVFDPAKI ISRTPNGVAI PHGDHYHFIP 

SPSAQRYSES DGLVFDPAKI ISRTPNGVAI PHGDHYHFIP 

SPSAQRYSES DGLVFDPAKI ISRTPNGVAI PHGDHYHFIP 

SPSAQRYSES DGLVFDPAKI ISRTPNGVAI PHGDHYHFIP 

LPLSQRHVES DGLVFDPAQI TSRTARGVAV PHGDHYHFIP 

LPLSQRHVES DGLIFDPAQI TSRTANGVAV PHGDHYHFIP 

LPLSQRHVES DGLVFDPAQI TSRTARGVAV PHGDHYHFIP 

KPLSERHVES DGLIFDPAQI TSRTANGVAV PHGDHYHFIP 

KPLSERHVES DGLIFDPAQI TSRTARGVAV PHGNHYHFIP 

KPLSERHVES DGLIFDPAQI TSRTARGVAV PHGNHYHFIP 



IARRVPISGT GSTVSTNAKP NEWSSLGSL SSNPS...SL 

IARRVPISGT GSTVSTNAKP NEWSSLGSL SSNPS...SL 

IARMVPISGT GSTVSTNAKP NEWSSLGSL SSNPS...SL 

IARMVPISGT GSTVSTNAKP NEWSSLGSL SSNPS...SL 

IARIIPLRYR SNHWVPDSRP .EQP§PQPTP EPSPG 

LARIIPLRYR SNHWVPDSRP .EQPSPQSTP EPSPSPQPAP 

IARIIPLRYR SNHWVPDSRP .EQPSPQPTP EPSPG 

LARIIPLRYR SNHWVPDSRP .EQPSPQSTP EPSPSPQPAP 

IARIIPLRYR SNHWVPDSRP .EQPSPQSTP EPSPSPQPAP 

IARIIPLRYR SNHWVPDSRP .EEPSPQPTP EPSPS 



DGYIFNPKDI VEETATAYIV RHGDHFHYIP KSNQIGQPTL 
DGYIFNPKDI VEETATAYIV RHGDHFHYIP KSNQIGQPTL 
DGYIFNPKDI VEETATAYIV RHGDHFHYIP KSNQIGQPTL 
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tr 
tr 
tr 
tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 
tr 



tr 
tr 
tr 



Q6WNQ5 
Q8DPQ2 
Q9AG74 
Q9AHT9 
Q8DQ08 
Q 3 ANY 2 
Q 9 ANY 3 



Q8DQ07 
Q6WNQ7 
Q9ANY1 
Q6WNQ5 
Q8DPQ2 
Q9AG74 
Q9AHT9 
Q8DQ08 
Q 9 ANY 2 
Q 9 ANY 3 



I Q6WNQ5_ 
I Q8DPQ2~ 
|Q9AG74~ 
|Q9AHT9~ 
IQ8DQ08" 
i Q9ANY2_ 
I Q 9 ANY 3 _ 



I Q8DQ07_ 
I Q6WNQ7_ 
I Q9ANY1_ 
I Q6WNQ5_ 
|Q8DPQ2_ 
|Q9AG74_ 
I Q9AHT9_ 
I Q8DQ08_ 
I Q9ANY2~ 
I Q 9 ANY 3" 



Q8DQ07 
Q6WNQ7 
Q9ANY1 
Q6WNQ5 
Q8DPQ2 
Q9AG74 
Q9AHT9 
Q8DQ08 
Q 9 ANY 2 
Q 9 ANY 3 



|Q8DQ07_ 
I Q6WNQ7~ 
I Q9ANYl] 
I Q6WNQ5] 
I Q8DPQ2~ 
|Q9AG74~ 
|Q9AHT9~ 
IQ8DQ08" 
I Q9ANY2~ 
I Q9ANY3 



Q8DQ07I 
Q6WNQ7 I 
Q9ANY1 I 
Q6WNQ5 I 
Q8DPQ2 I 
Q9AG7 4 I 
Q9AHT9 I 
Q8DQ08 J 
Q 9 ANY 2 I 
Q 9 ANY 3 I 



Q8DQ07 I 
Q6WNQ7 I 
Q9ANY1 I 
Q6WNQ5 I 
Q8DPQ2 I 
Q9AG74 I 
Q9AHT9 I 
Q8DQ08 I 
Q 9 ANY 2 I 
Q 9 ANY 3 I 



Q8DQ07_ 
Q6WNQ7~ 
Q9ANYl[ 
Q6WNQ5] 
Q8DPQ2] 
Q9AG74~ 
Q9AHT9~ 
Q8DQ08~ 
Q 9 ANY 2~ 
Q 9 ANY 3~ 



Q8DQ07_ 
Q6WNQ7] 
Q9ANY1_ 
Q6WNQ5~ 
Q8DPQ2_ 
Q9AG74~ 
Q9AHT9~ 
Q8DQ08~ 
Q 9 ANY 2~ 
Q 9 ANY 3" 



STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



STRR6 
STRPN 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 
STRR6 
STRPN 
STRPN 



TTSKELSSAS 
. PQPAPNLKI 
NPQPAPSNPI 
. PQPAPNLKI 
NPQPAPSNPI 
NPQPAPSNPI 
. PQPAPSNPI 



PNNSLATPSP 
PNNSLATPSP 
PNNSLATPSP 
PNNSLATPSP 
PSETVKNLES 
SAETAAGIDS 
PSETVKNLES 
SAETAAGIDS 
SAETAAGIDS 
SAETAAGIDS 



NHYFFKKDLT 
NHYFFKKDLT 
NHYFFKKDLT 
NHYFFKKDLT 
EFYDKAYNLL 
EFYNKAYDLL 
EFYDKAYNLL 
EFYNKAYDLL 
EFYNKAYDLL 
EFYNKAYDLL 



LDKKIEEKIA 
LDKKIEEKIA 
LDKKIEEKIA 
LDKKIEEKIA 
KEKLVDDLLA 
KVKLVDDILA 
KEKLVDDLLA 
KVKLVDDILA 
KVKLVDDILA 
KVKLVEDILA 



VGIGHSHSNY 
VGIGHSHSNY 
VGIGHSHSNY 
VGIGHSHSNY 
ADK. YTTSDG 
AGK . YTTEDG 
ADK. YTTSDG 
AGK. YTTEDG 
AGK. YTTEDG 
AGK. YTAEDG 



DGYIFNPKDI VEETATAYIV RHGDHFHYIP KSNQIGQPTL 

DSNSSLVSQL VRKVGEGYVF EEKGISRYVF AKD L 

DEK. . LVKEA VRKVGDGYVF EENGVPRYIP AKD L 

DSNSSLVSQL VRKVGEGYVF EEKGISRYVF AKD L 

DEK. .LVKEA VRKVGDGYVF EENGVPRYIP AKD L 

DEK. .LVKEA VRKVGDGYVF EENGVSRYIP AKD L 

DGK. . LVKEA VRKVGDGYVF EENGVSRYIP AKD L 



SLPINPGISH EKHEEDGYGF DANRIIAEDE 
SLPINPGISH EKHEEDGYGF DANRIIAEDE 
SLPINPGTSH EKHEEDGYGF DANRIIAEDE 
SLPINPGTSH EKHEEDGYGF DANRIIAEDE 

KLSKQESVSH T LTAKKE 

KLAKQESLSH K LGAKKT 

KLSKQESVSH T LTAKKE 

KLAKQESLSH K LGAKKT 

KLAKQESLSH K LGAKKT 

KLAKQESLSH K LGTKKT 



SGFIMSHGNH 
SGFIMSHGNH 
SGFVMSHGDH 
SGFVMSHGDH 
N . . . VAPRDQ 
LPSSDR 
VAPRDQ 
LPSSDR 
LPSSDR 
LPSSDR 



EEQIKAAQKH 
EEQIKAAQKH 
EEQIKAAQKH 
EEQIKAAQKH 
TEAHKALFEN 
ARIHQDLLDN 
TEAHKALFXN 
ARIHQDLLDN 
ARIHQDLLDN 
ARIHQDLLDN 



GIMKQYGVKR 
GIMKQYGVKR 
GIMKQYGVKR 
GIMKQYGVKR 
FLAPITHPER 
FLAPIRHPER 
FLAPITHPER 
FLAPIRHPER 
FLAPIRHPER 
FLAPIRHPER 



ELFKPEEGVA 
ELFKPEEGVA 
ELFKPEEGVA 
ELFKPEEGVA 
YIFDEHD. II 
YIFDPRD. IT 
YIFDEHD. II 
YIFDPRD.IT 
YIFDPRD. IT 
YIFDPRD. IT 



LEEVKTSHNG 
LEEVKTSHNG 
LEEVKTSHNG 
LEEVKTSHNG 
. KGRNSDFQA 
. KGRQVDFEA 
. KGRNSDFQA 
.KGRQVDFEA 
.KGRQVDFEA 
. KGRQVDFEA 



ESIWNKEKN 
ESIWNKEKN 
ESIWNKEKN 
ESIWNKEKN 
. LGKPNS 
. LGKPNA 
. LGKPNS 
. LGKPNA 
. LGKPNA 
. LGKPNA 



KKEGNKVYTG 
KKEGNKVYTG 
KKEGNKVYTG 
KKEGNKVYTG 
SDEGD.AYVT 
SDEGD.AYVT 
SDEGD.AYVT 
SDEGD.AYVT 
SDEGD.AYVT 
SDEGD.AYVT 



LDSLSSHEQD 
LDSLSSHEQD 
LDSLSSHEQD 
LDSLSSHEQD 
LDKLLERLND 
LDNLLERLKD 
LDKLLERLND 
LDNLLERLKD 
LDNLLERLKD 
LDNLLERLKD 



AIIYPHGDHH 
AIIYPHGDHH 
AIIYPHGDHH 
AIIYPHGDHH 
QIEYTE. 
QITYTD. 
QIEYTE. 
QITYTD. 
QITYTD. 
QITYTD. 



EELTNWNLL 
EELTNWNLL 
EELTNWNLL 
EELTNWNLL 
PHMGHS . HWI 
PHMTHS . HWI 
PHMGHS . HWI 
PHMTHS . HWI 
PHMTHS . HWI 
PHMTHS. HWI 



YPGNAKEMKD 
YPGNAKEMKD 
YPSNAKEMKD 
YPSNAKEMKD 

EST N 

VSS D 

EST. . . . . .N 

VSS D 

VPS D 

VSS D 



HADPIDEHKP 
HADPIDEHKP 
HADPIDEHKP 
HADPIDEHKP 
. DEVRIAQL 
. DEIQVAKL 
. DEVRIAQL 
.DEIQVAKL 
.DEIQVAKL 
. DEIQVAKL 



KNSTFNNQNF 
KNSTFNNQNF 
KNSTFNNQNF 
KNSTFNNQNF 
GKDSLSDKEK 
KKDSLSEAER 
GKDSLSDKEK 
KKDSLSEAER 
KKDSLSEAER 
KKDSLSEAER 



Q8DQ07 I Q8DQ07_STRR6 
Q6WNQ7 I Q6WNQ7_STRPN 
Q9ANY1 IQ9ANY1 STRPN 



TLANGQKRVS FSFPPELEKK LGINMLVKLI TPDGKVLEKV SGKVFGEGVG 
TLANGQKRVS FSFPPELEKK LGINMLVKLI TPDGKVLEKV SGKVFGEGVG 
TLANGQKRVS FSFPPELEKK LGINMLVKLI TPDGKVLEKV SGKVFGEGVG 
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tr 


Q6WNQ5 


|Q6WNQ5_ 


_STRPN 


tr 


Q8DPQ2 


Q8DPQ2~ 


_STRR6 


tr 


Q9AG74 


Q9AG74~ 


JSTRPN 


tr 


Q9AHT9 


Q9AHT9 


JSTRPN 


tr 


Q8DQ08 


Q8DQ08 


JSTRR6 


tr 


Q 9 ANY 2 


Q9ANY2_ 


_STRPN 


tr 


Q 9 ANY 3 


Q 9 ANY 3" 


_STRPN 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


tr 


Q6WNQ7 


Q6WNQ7" 


_STRPN 


tr 


Q9ANY1 


Q9ANY1 


_STRPN 


tr 


Q6WNQ5 


Q6WNQ5_ 


_STRPN 


tr 


Q8DPQ2 


Q8DPQ2~ 


_STRR6 


tr 


Q9AG74 


Q9AG7 4~ 


_STRPN 


tr 


Q9AHT9 


Q9AHT9~ 


_STRPN 


tr 


Q8DQ08 


Q8DQ08~ 


_STRR6 


tr 


Q 9 ANY 2 


Q9ANY2_ 


_STRPN 


tr 


Q 9 ANY 3 


Q9ANY3_ 


_STRPN 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


tr 


Q6WNQ7 


Q6WNQ7_ 


_STRPN 


tr 


Q9ANY1 


Q9ANY1~ 


_STRPN 


tr 


Q6WNQ5 


Q6WNQ5~ 


_STRPN 


tr 


Q8DPQ2 


Q8DPQ2~ 


JSTRR6 


tr 


Q9AG7 4 


Q9AG7 4" 


_STRPN 


tr 


Q9AHT9 


Q9AHT9_ 


JSTRPN 


tr 


Q8DQ08 


Q8DQ08~ 


_STRR6 


tr 


Q9ANY2 


Q9ANY2^ 


_STRPN 


tr 


Q 9 ANY 3 


Q 9 ANY 3^ 


_STRPN 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


tr 


Q6WNQ7 


Q6WNQ7~ 


_STRPN 


tr 


Q9ANY1 


Q9ANY1~ 


_STRPN 


tr 


Q6WNQ5 


Q6WNQ5~ 


_STRPN 


tr 


Q8DPQ2 


Q8DPQ2^ 


_STRR6 


tr 


Q9AG74 


Q9AG7 4~ 


_STRPN 


tr 


Q9AHT9 


Q9AHT9~ 


_STRPN 


tr 


Q8DQ08 


Q8DQ0 8~ 


_STRR6 


tr 


Q 9 ANY 2 


Q9ANY2~ 


_STRPN 


tr 


Q 9 ANY 3 


Q 9 ANY 3~ 


_STRPN 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


tr 


Q6WNQ7 


Q6WNQ7~ 


_STRPN 


tr 


Q9ANY1 


Q9ANY1~ 


_STRPN 


tr 


Q6WNQ5 


Q6WNQ5~ 


_STRPN 


tr 


Q8DPQ2 


Q8DPQ2~ 


_STRR6 


tr 


Q9AG7 4 


Q9AG74_ 


_STRPN 


tr 


Q9AHT9 


Q9AHT9~ 


_STRPN 


tr 


Q8DQ08 


Q8DQ08_ 


_STRR6 


tr 


Q 9 ANY 2 


Q9ANY2~ 


_STRPN 


tr 


Q9ANY3 


Q 9 ANY 3^ 


_STRPN 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


tr 


Q6WNQ7 


Q6WNQ7~ 


_STRPN 


tr 


Q9ANY1 


Q9ANYl" 


STRPN 



TLANGQKRVS 
VAAQAYTKEK 
AAAQAYAKEK 
VAAQAYTKEK 
AAAQAYAKEK 
AAAQAYAKEK 
AAAQAY AXE K 



NIANFELDQP 
NIANFELDQP 
NIANFELDQP 
NIANFELDQP 
KRIPLVR. 
KKVPLDR . 
KRIPLVR. 
KKVPLDR. 
KKVPLDR . 
KKVPLDR. 



FSFPPELEKK LGINMLVKLI TPDGKVLEKV 

GILPPSPDAD VKAN PTGDSAAAI 

GLTPPSTDHQ DSGN TEAKGAEAI 

GILPPSPDAD VKAN PTGDSAAAI 

GLTPPSTDHQ DSGN TEAKGAEAI 

GLTPPSTDHQ DSGN TEAKGAEAI 

GLTPPSTDHQ DSGN TEAKGAEAI 



YLPGQTFKYT 
YLPGQTFKYT 
YLPGQTFKYT 
YLPGQTFKYT 
.LP. YMVEHT 
.MP. YNLQYT 
.LP. YMVEHT 
.MP. YNLQYT 
.MP. YNLQYT 
.MP. YNLQYT 



IASKDYPEVS YDGTFTVPTS 
IASKDYPEVS YDGTFTVPTS 
IASKDYPEVS YDGTFTVPTS 
IASKDYPEVS YDGTFTVPTS 

VEVK NGNLIIP. . 

VEVK NGSLIIP. . 

VEVK NGNLIIP. . 

VEVK NGSLIIP. . 

VEVK NGSLIIP. . 

VEVK NGSLIIP. . 



FYPFHAGDTY LRVNPQFAVP 
FYPFHAGDTY LRVNPQFAVP 
FYPFHAGDTY LRVNPQFAVP 
FYPFHAGDTY LRVNPQFAVP 

FAWFDDH TYKAP 

FEWFDEG LYEAP 

FAWFDDH TYKAP 

FEWFDEG LYEAP 

FEWFDEG LYEAP 

FEWFDEG LYEAP 



KGTDALVRVF 
KGTDALVRVF 
KGTDALVRVF 
KGTDALVRVF 
NG. YTLEDLF 
KG. YSLEDLL 
NG. YTLEDLF 
KG. YSLEDLL 
KG. YTLEDLL 
KG. YTLEDLL 



DEFHGNAYLE 
DEFHGNAYLE 
DEFHGNAYLE 
DEFHGNAYLE 
ATIK. . YYVE 
ATVK . . YYVE 
ATIK. .YYVE 
ATVK . . YYVE 
ATVK . . YYVE 
ATVK . . YYVE 



SGKVFGEGVG 
YNRVKGE . 
YNRVKAA. 
YNRVKGE . 
YNRVKAA. 
YNRVKAA. 
YNRVKAA. 



LAYKMASQTI 
LAYKMASQTI 
LAYKMASQTI 
LAYKMASQTI 
. HKDHYHNIK 
. HYDHYHNIK 
. HKDHYHNIK 
. HYDHYHNIK 
. HYDHYHNIK 
. HYDHYHNIK 



NNYKVGEIKL 
NNYKVGEIKL 
NNYKVGEIKL 
NNYKVGEIKL 

HPDER 

HPNER 

HPDER 

HPNER 

HPNER 

HPNER 



PIPKLNQGTT 
PIPKLNQGTT 
PIPKLNQGTT 
PIPKLNQGTT 

. PHSNDG. . 

. PHSDNG. . 

. PHSNDG. . 

. PHSDNG. . 

. PHSDNG. . 

. PHSDNG. . 



RTAGNKI PVT 
RTAGNKI PVT 
RTAGNKI PVT 
RTAGNKI PVT 
. . WGN. . 
. . FGN. . 
. .WGN. . 
. . FGN. . 
. . FGN. . 
. . FGN . . 



FMANAYLDNQ STYIVEVP. . . . ILEKENQT 
FMANAYLDNQ STYIVEVP.. ..ILEKENQT 
FMANAYLDNQ STYIVEVP. . . .ILEKENQT 
FMANAYLDNQ STYIVEVP.. . . ILEKENQT 

. . ASEHVLGK K DHS 

. .ASDHVQRN KNGQ ADTNQT 

. .ASEHVLGK K DHS 

. .ASDHVQRN KNGQADTNQT EKPNEEKPQT 

. . ASDHVRKN K VDQD 

..ASDHVQRN KNGQ ADTNQT 



DKPSILPQFK RNKAQENSKL 
DKPSILPQFK RNKAQENSKL 
DKPSILPQFK RNKAQENLKL 
DKPSILPQFK RNKAQENSKF 

EDP NKNFKA 

EKP NEEKPQ 

EDP NKNFKA 

EKP EEDKEH 

SKP DEDKEH 

EKP SEEKPQ 



DEKVEEPKTS 
DEKVEEPKTS 
DEKVEEPKTS 
DEKVEEPKTS 
DEEPVE. .ET 
TEKPEE 
DEEPVE 
DEVSEP 
DEVSEP 
TEKPEE 



ET 
ET 
TH 
TH 
ET 



EKVEKEKLSE TGNSTSNSTL 
EKVEKEKLSE TGNSTSNSTL 
EKVEKEKLSE TGNSTSNSTL 
EKVEKEKLSE TGNSTSNSTL 

P.AEP 

PREEKP . QSE KPESP 

P.AEP 

PESDEK.ENH VGLNPS.ADN 
PESDEK.ENH AGLNPS.ADN 
PREEKP. QSE KPESP 



EEVPTVDPVQ EKVAKFAESY G.MKLENVLF NMDGTIELYL PSGEVIKKNM 
EEVPTVDPVQ EKVAKFAESY G.MKLENVLF NMDGTIELYL PSGEVIKKNM 
EEVPTVDPVQ EKVAKFAESY G.MKLENVLF NMDGTIELYL PSGEVIKKNM 
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tr 


Q6WNQ5 


Q6WNQ5_ 


_STRPN 


tr 


Q8DPQ2 


Q8DPQ2_ 


_STRR6 


tr 


Q9AG74 


Q9AG74_ 


_STRPN 


tr 


Q9AHT9 


Q9AHT9_ 


_STRPN 


tr 


Q8DQ08 


Q8DQ08_ 


_STRR6 


tr 


Q 9 ANY 2 


Q9ANY2_ 


_STRPN 


tr 


Q 9 ANY 3 


Q 9 ANY 3 


_STRPN 


tr 


Q8DQ07 


Q8DQ07_ 


_STRR6 


tr 


Q6WNQ7 


Q6WNQ7_ 


_STRPN 


tr 


Q9ANY1 


Q9ANY1~ 


_STRPN 


tr 


Q6WNQ5 


Q6WNQ5~ 


_STRPN 


tr 


Q8DPQ2 


Q8DPQ2" 


~STRR6 


tr 


Q9AG74 


Q9AG74~ 


_STRPN 


tr 


Q9AHT9 


Q9AHT9~ 


_STRPN 


tr 


Q8DQ08 


Q8DQ08" 


_STRR6 


tr 


Q 9 ANY 2 


Q9ANY2_ 


_STRPN 


tr 


Q 9 ANY 3 


Q9ANY3_ 


_STRPN 


tr 


Q8DQ07 


Q8DQ07_ 


J3TRR6 


tr 


Q6WNQ7 


Q6WNQ7 


_STRPN 


tr 


Q9ANY1 


Q9ANY1_ 


_STRPN 


tr 


Q6WNQ5 


Q6WNQ5_ 


JSTRPN 


tr 


Q8DPQ2 


Q8DPQ2_ 


_STRR6 


tr 


Q9AG7 4 


Q9AG74~ 


JSTRPN 


tr 


Q9AHT9 


Q9AHT9 


JSTRPN 


tr 


Q8DQ08 


Q8DQ08_ 


_STRR6 


tr 


Q 9 ANY 2 


Q9ANY2" 


_STRPN 


tr 


Q 9 ANY 3 


Q9ANY3_ 


_STRPN 


tr 


Q8DQ07 


Q8DQ07 


_STRR6 


tr 


Q6WNQ7 


Q6WNQ7 


JSTRPN 


tr 


Q9ANY1 


Q9ANY1 


_STRPN 


tr 


Q6WNQ5 


Q6WNQ5 


_STRPN 


tr 


Q8DPQ2 


Q8DPQ2 


_STRR6 


tr 


Q9AG74 


Q9AG74~ 


_STRPN 


tr 


Q9AHT9 


Q9AHT9~ 


_STRPN 


tr 


Q8DQ08 


Q8DQ08" 


_STRR6 


tr 


Q 9 ANY 2 


Q 9 ANY 2 


_STRPN 


tr 


Q 9 ANY 3 


Q9ANY3_ 


_STRPN 



EEVPTVDPVQ EKVAKFAESY G.MKLENVLF 

E VPQVET 

. . KPTEEPEE ESPEESPE.. . ESEEPQVET 

E VPQVET 

LYKPSTDTEE . TEEEAEDTT DEAEIPQVEH 
LYKPSTDTEE .TEEEAEDTT DEAEIPQVEN 
. . KPTEEPEE . SPEESEE PQVET 



ADFTGEAPQG 
ADFTGEAPQG 
ADFTGEAPQG 
ADFTGEAPQG 
TDSS. . LKAN 
QNPI . . IKSN 
TDSS. .LKAN 



TDSS, 
TDPS, 
QDPI . 



, IRQN 
, IRQN 
IKSN 



NGENKPSENG 
NGENKPSENG 
NGENKPSENG 
NGENKPSENG 
ATETLAGLRN 
AKETLTGLKN 
ATETLAGLRN 
AVETLTGLKS 
AMETLTGLKS 
AKETLTGLKN 



KVSTGTVENQ 
KVSTGTVENQ 
KVSTGTVENQ 
KVSTGTVENQ 
NLTLQIMDNN 
NLLFGTQDNN 
NLTLQIMDNN 
SLLLGTKDNN 
SLLLGTKDNN 
NLLFGTQDNN 



NMDGTIELYL 
. EKVEAQL 
. EKVKEKL 
. EKVEAQL 
.SVINAKI 
. SVINAKI 
. EKVEEKL 



PTENKPADSL 
PTENKPADSL 
PTENKPADSL 
PTENKPADSL 
.SIMMAEKL 
. TIMAEAEKL 
. SIMAEAEKL 
. TISAEVDSL 
. TISAEVDSL 
.TIMAEAEKL 



PSGEVIKKNM 
KEAEVLLAKV 
REAEDLLGKI 
KEAEVLLAKV 
AEAEALLEKV 
ADAEALLEKV 
REAEDLLGKI 



PEAPNEKPVK 
PEAPNEKPVK 
PEAPNEKPVK 
PEAPNEKPVK 



LALLKGS . 
LALLKES . 
LALLKGS . 
LALLKES. 
LALLKES . 
LALLKES . 



.N 
.K 
.N 
• Q 
-Q 
.K 



PENSTDNGML NPEGNVGSDP MLDPALEEAP AVDPVQEKLE KFTASYGLGL 
PENSTDNGML NPEGNVGSDP MLDPALEEAP AVDPVQEKLE KFTASYGLGL 
PENSTDNGML NPEGNVGSDP MLDPALEEAP AVDPVQEKLE KFTASYGLGL 
PENSTDNGML NPEGNVGSDP MLDPALEEAP AVDPVQEKLE KFTASYGLGL 
PSSVSKEKIN 

PSSVSKEKIN 

PTPIQ 

PAPIQ 



DSVIFNMDGT IELRLPSGEV IKKNLSDLIA 

DSVIFNMDGT IELRLPSGEV IKKNLSDLIA 

DSVIFNMDGT IELRLPSGEV IKKNLSDLIA 

DSVIFNMDGT IELRLPSGEV IKKNLSDLIA 
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Alignments 



tr Q8PQ07 Pneumococcal histidine triad protein E [phtE] [Streptococcus 103 9 AA 
pneumoniae (strain ATCC BAA- 255 / R6)] 

align 

Score = 2017 bits (5225), Expect = 0.0 

Identities = 1004/1039 (96%), Positives = 1004/1039 (96%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 
Sbjct: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVXIGSQSSQKSENLTPDQVS 60 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 12 0 

QKEGIQAEQI VI KI TDQGYVTSHGDHYHY YNGKVPYDALFSEELLMKDPNYQLKDAD I VN 
Sbjct: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYD^ 12 0 

Query: 121 EVKGGYI IKVDGKYYWLKDAAHADNVRTKDEINRQKQEHVKDNEKWSN^ 180 

EVKGGYIIKVDGKYYVYLKDAAHADNVRTKM^ 
Sbjct: 121 EVTCGGYIIKVDGKYYWLKDAAHADNVRTKDEIN^^ 180 

Query: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS 240 

TTNDGYVFNPADI IEDTGNAYIVPHGGHYHYI P NMQPSQLS 
Sbjct: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLS 240 

Query: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Query: 3 01 TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXX 360 

TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEW 
Sbjct: 301 TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWSSLGSLSSN 360 

Query: 361 XXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

KELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 
Sbjct: 361 PSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

Query: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 
Sbjct: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

Query: 4 81 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIVVN 
Sbjct: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIVVN 540 

Query: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 
Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 WLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 
Sbjct: 601 WLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

Query: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 720 

EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 
Sbjct: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 720 
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Query: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 7 80 

GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAyLENNYKVGEIKLPIPKIjNQGTTRTAGN'K 
Sbjct: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 7 80 

Query: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKIWKAQENSKLDEKVEEPKTS 840 

IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 
Sbjct: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSIIiPQFKRNKAQENSKLDEKVEEPKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
Sbjct: 841 BKV^KEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGM 900 

Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 
Sbjct: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 
Sbjct: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

ELRLPSGEVIKKNLSDLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 1039 



tr S$MS1 Surface protein BVH-3 [bvh-3] [Streptococcus 103 9 AA 

pneumoniae] align 

Score = 2017 bits (5225), Expect =0.0 

Identities = 1004/1039 (96%), Positives = 1004/1039 (96%) 

Query: 1 MKFS KKY I AAGS AVI VS LSLCAYALNQHRS QENKDNNRVS YVDGS QS S QKS ENLTPDQVS 60 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 
Sbjct: 1 MKFS KKY IAAGSAVIVSLSLCAYALNQHRSQENIO)NNRVS YVDGS QSS QKS ENLTPDQVS 60 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 
Sbjct: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 12 0 

Query: 121 EVKGGYIIKVDGKYYWLKDAAHADNVRTKDEINRQKQ 1 80 

EVKGGYI IKVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKDNEKVNSNVAVARSQGRY 
Sbjct: 121 EV7CGGYIIKVDGKYYVTLKDAAHADNWTKDEINRQKQEHVKDNEKW 18 0 

Query: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS 240 

TTNDGYVFNPAD I IEDTGNAYI VPHGGHYHYI P NMQPSQLS 
Sbjct: 181 TTNDGYVFNP AD I IEDTGNAY I VPHGGH YHY I PKSDLS ASELAAAKAHLAGKNMQP SQLS 240 



Query: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Query: 3 01 TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXX 360 

TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEW 
Sbjct: 301 TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWSSLGSLSSN 360 

Query: 361 XXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 
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KELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 
Sbjct: 361 PSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

Query: 421 TPSPSLPINPGISHEKHEEDGYGPDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 
Sbjct: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

Query: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 
Sbjct: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

Query: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGWKVYTGEELTNV 
Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 WLLKKSTFNNQNFTI^^GQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

VNLLKMSTFNWQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 
Sbjct: 601 VNLLKNSTFNNQNFTZiANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

Query: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 
Sbjct: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

Query; 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 780 

GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGE IKLP IPKLNQGTTRTAGNK 
Sbjct: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLP I PKLNQGTTRTAGNK 780 

Query: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS B40 

IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSIIjPQFKRNKAQENSKLDEKVEEPKTS 
Sbjct: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLEWVLFNMDGTIELYLP 
Sbjct: 841 EKVEKEICLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 
Sbjct: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVTFNMDGTI 
Sbjct : 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

ELRLPSGEVIKKNLSDLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 103 9 



tr Q9ANY1 Pneumococcal hietidine triad protein E precursor 1039 
(Hypothetical ^ 
protein SP1004) [phtE] [Streptococcus pneumoniae] align 

Score = 2006 bits (5196), Expect =0.0 

Identities = 998/1039 (96%), Positives = 1000/1039 (96%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 
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MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 
Sbjct: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

Query: 61 QKEGIQAEQIVIKITDQGyVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 12 0 

QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIW 
Sbjct: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

Query: 121 E VKGG YI I KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKDNE KVNSNVAVARS QGRY 18 0 

EVKGGYIIKVDGKYYVYLKDAAHADWVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 
Sbjct: 121 EVKGGYI I KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKDNE KVNSNVAVARS QGRY 180 

Query: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS 240 

TTNDGYVFNPADI IEDTGNAYIVPHGGHYHYIP NMQPSQLS 
Sbjct: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLS 240 

Query: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Query: 3 01 TPNG VAI PHGDH YHF I P YS KLS ALEEKI ARRVPI SGTGS T VSTNAKPNEWXXXXXXXXX 3 60 

TPNGVAIPHGDHYHFI PYSKLSALEEKIAR VPISGTGSTVSTNAKPNEW 
Sbjct: 301 TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPI SGTGS TVSTNAKPNEWSSLGSLSSN 360 

Query: 361 XXXXXXXKELSS ASDGY I FNPKD I VEETATAYI VRHGDHFH YI PKSNQ IGQPTLPNNS LA 42 0 

KELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 
Sbjct: 361 PS SLTTSKELSSASDGYIFNPKD I VEETATAYI VRHGDHFHYIPKSNQIGQPTLPNNS LA 420 

Query: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

TPSPSLPINPG SHEKHEEDGYGFDANRI IAEDESGF+MSHG+HNHYFFKKDLTEEQIKA 
Sbjct: 421 TPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKA 480 

Query: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

AQKHLEEVKTSHNGLDSLSSHEQDYP NAKEMKDLDKKIE.EKIAGIMKQYGVKRES I WN 
Sbjct: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

Query: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 
Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

WLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 
Sbjct: 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

Query: 661 EGVGN I ANFE LDQP YLPGQTFKYT I AS KD YPE VS YDGTFTVPTSLAYKMASQT I F YPFHA 72 0 

EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 
Sbjct: 661 EGVGN I ANFE LDQP YLPGQTFKYT I AS KD YPE VS YDGTFTVPTSLAYKMASQT IF YPFHA 720 

Query: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGE I KLP IPKLNQGTTRTAGNK 780 

GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 
Sbjct: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 780 

Query: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

I PVTFMANA YLDNQSTY I VE VP I h EKENQTD KPS I LPQFKRNKAQEN KLDEKVEEPKTS 
Sbjct: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
Sbjct: 841 EKVEKEKLS ETGN STSNSTLEE VPTVDPVQEKVAKFAES YGMKLENVLFNMDGT I EL YLP 900 
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Query; 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 
Sbjct: 901 SGEVIKKNMABFTGEAPQGNGENKPSEN^ 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 
Sbjct: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKPTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

ELRLPSGEVIKKNLSDLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 1039 



tr .Q6WNQ5 Surface protein BVH-3 (Fragment) [bvh-3] [Streptococcus 1019 
pneumoniae) ^ 

align 

Score = 1968 bits (5099), Expect =0.0 

Identities = 977/1019 (95%), Positives = 979/1019 (95%) 

Query: 21 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 80 

CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKTTDQGYV 
Sbjct: 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 60 

Query: 81 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYV^LKD 14 0 

TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKD 
Sbjct: 61 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKD 12 0 

Query: 141 AAHADNVT^TKDE INRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPAD I IEDTGNA 200 

AAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 
Sbjct: 121 AAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPAD I IEDTGNA 180 

Query: 201 YIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSK 260 

YIVPH GHYHYIP NMQPSQLSYSSTASDNNTQSVAKGSTSK 
Sbjct: 181 YIVPHRGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 240 

Query: 261 PANKSENLQSLLKELYDSPS AQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHFI PYSK 320 

PANKSENLQSLLKELYDSPS AQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHFI PYSK 
Sbjct: 241 PANKSENLQSLLKELYDSPS AQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHFI PYSK 300 

Query: 321 LSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFN 380 

LSALEEKIAR VPISGTGSTVSTNAKPNEW KELSSASDGYIFN 
Sbjct: 301 LS ALE E KI ARMVP I S GTGS TVS TNAKPNEWSSLGSLSSNPSSLTTS KELSSASDGYIFN 360 

Query: 381 PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHEED 440 

PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG SHEKHEED 
Sbjct; 361 PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 420 

Query: 441 GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 500 

GYGFDANRIIAEDESGF+MSHG+HNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 
Sbjct: 421 GYGFDANR 1 1 AEDESGFVMSHGDHNH YFFKKDLTEEQI KAAQKHLEEVKTSHNGLDSLS S 480 

Query: 501 HEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIVVNKEKNAIIYPHGDHHHADPID 560 

HEQDYP NAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPID 
Sbjct: 481 HEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPID 540 
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Query: 561 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNVVNLLKNSTFNNQNFTLANGQ 620 

EHKPVGIGHSHSNTYELFKPEEGVAKKEGNKVYTGEELTNVWL^ 
Sbjct: 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTO^ 600 

Query: 621 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 680 

KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 
Sbjct: 601 KRVSFSFPPELEKKIX5INMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 660 

Query: 681 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 740 

FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 
Sbjct: 661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 720 

Query: 741 VRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVE 800 

VRVFDE FHGNAYLENN YKVG E I KLP I PKLNQGTTRTAGNKI P VTFMANAYLDNQST Y I VE 
Sbjct: 721 VRVFDE FHGNAYLENNYKVGE I KLP I PKLNQGTTRTAGNKI PVTFMANAYLDNQSTY I VE 780 

Query: 801 VPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 860 

VPI LEKENQTDKPS ILPQFKRNKAQENS K DEKVEEPKTSEKVEKEKLSETGNSTSNSTL 
Sbjct: 781 VPI LEKENQTDKPS ILPQFKRNKAQENSKFDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

Query: 861 EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGN 92 0 

EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGN 
Sbjct: 841 EEVPTVDPVQEKVAKFAESYGMKLENVLFNI^GTIELYLPSGEVIKKNMADFTGEAPQGN 900 

Query: 921 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 98 0 

GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 
Sbjct: 901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 

Query: 981 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA 1039 

LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVI FNMDGTIELRLPSGEVIKKNLSDLIA 
Sbjct: 961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVI FNMDGTIELRLPSGEVIKKNLSDLIA 1019 



tr QL?WR4 Histidine Motif -Containing protein [phpA] [Streptococcus 
pneumoniae 

(strain ATCC BAA-255 / R6)] 

Score = 442 bits (1137), Expect = e-122 

Identities = 219/369 (59%), Positives = 271/369 (73%), Gaps = 21/369 (5%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MK +KKY+A A +V LS+CAY L H++Q K+NNRVSY+DG Q++QK+ENLTPD+VS 
Sbjct: 12 MKINKKYLAGS VATLV - LS VCAYELGLHQAQTVKENNRVS YIDGKQATQKTENLTPDE VS 70 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

++EGI AEQI VIKI TDQGYVTSHGDHYHYYNGKVP YDA+ SEELLMKDPNYQLKD DI++ 
Sbjct: 71 KREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAI ISEELLMKDPNYQLKDEDI IS 130 

Query: 121 EVKGGYI IKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNE KVNSNV AVARS 176 

E+KGGY+ IKVDGKYYVYLKDAAHADNVRTK+EINRQKQEH + E + + VA+ARS 
Sbjct: 131 EIKGGYVIKVDGKYYVYLKDAAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARS 190 

Query: 177 QGRYTTNDGYVFNPAD 1 1 EDTGNAY I VPHGGHYHY IPXXXXXXXXXXXXXXXXXXXNMQP 236 

QGRYTT+DGY +FN +DIIEDTG+AYIVPHG HYHYIP 
Sbjct: 191 QGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAKAFLSGRGNLS 250 
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Entry name 

Primary accession number 
Secondary accession number 
Entered in TrEMBL in 
Sequence was last modified in 
Annotations were last modified in 
Name and origin of the protem 
Protein name 
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Gene name 



Q9 ANY 1_STRPN 
Q9ANY1 

Q7D4B6 

Release 17, June 2001 
Release 17, June 2001 
Release 30, May 2005 



Pneumococcal histidine triad protein E [Precursor] 
Hypothetical protein SP1004 

Name: phtE 

OrderedLocusNames: SP1004 
From Streptococcus pneumoniae [TaxID: 1313] 

Taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; 

Streptococcus. 

References 
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DOI= 10. 1126/science. 1061217; PubMed=l 1463916 [NCBI, ExPASy, EBI, Israel, Japan] 
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Keywords 
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AE007403; AAK75121.1; -; 
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Get region on 2D PAGE. 
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SIGNAL 1 29 29 Potential. 



Sequence information 

Length: 1039 Molecular weight: 114631 CRC64: 81A563FC806625C4 [This is a checksum on the 
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YIPKSDLSAS 


ELAAAKAHLA 


GKNMQPSQLS 


250 


2 6_0 


27.0 


2 8.0 


290 


300 


YSSTASDNNT 


QSVAKGSTSK 


PANKSENLQS 


LLKELYDSPS 


AQRYSESDGL 


VFDPAKIISR 


310 


320 


330 


340 


350 


360 


TPNGVAIPHG 


DHYHFIPYSK 


LSALEEKIAR 


MVPISGTGST 


VSTNAKPNEV 


VSSLGSLSSN 


37.0 


38J0 


390 


40.0 


410 


42.0 


PSSLTTSKEL 


SSASDGYIFN 


PKDIVEETAT 


AYIVRHGDHF 


HYIPKSNQIG 


QPTLPNNSLA 


430 


44_0 


450 


460 


470 


480 
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TPSPSLPINP GTSHEKHEED GYGFDANRII AEDESGFVMS HGDHNHYFFK KDLTEEQIKA 

490 500 510. 520 530 540 

AQKHLEEVKT SHNGLDSLSS HEQDYPSNAK EMKDLDKKIE EKIAGIMKQY GVKRESIWN 

550 560 510 580 590 600 

KEKNAIIYPH GDHHHADPID EHKPVGIGHS HSNYELFKPE EGVAKKEGNK VYTGEELTNV 

610 620 630 640 650 660 

VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE LEKKLGINML VKLITPDGKV. LEKVSGKVFG 

670 680 690 700 710 720 

EGVGNIANFE LDQPYLPGQT FKYTIASKDY PEVSYDGTFT VPTSLAYKMA SQTIFYPFHA 

730 740 750 76CI 770 780 

GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN AYLENNYKVG EIKLPIPKLN QGTTRTAGNK 

790 BOO 810 $20 830 840 

IPVTFMANAY LDNQSTYIVE VPILEKENQT DKPSILPQFK RNKAQENLKL DEKVEEPKTS 

850 86_0 870 88_0 890 900 

EKVEKEKLSE TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY GMKLENVLFN MDGTIELYLP 

910 920 930 940 95_0 960 

SGEVIKKNMA DFTGEAPQGN GENKPSENGK VSTGTVENQP TENKPADSLP EAPNEKPVKP 

970 980 99,0 1000 1010 1020 

ENSTDNGMLN PEGNVGSDPM LDPALEEAPA VDPVQEKLEK FTASYGLGLD SVIFNMDGTI 

1030 

ELRLPSGEVI KKNLSDLIA Q9ANY1 in FASTA 

format 



View entry in original UniProtKB/TrEMBL format 

View entry in raw text format (no links) 

Request for annotation of this UniProtKB/TrEMBL entry 



BLAST submission on 
BLAST ExPASy/SIB 

oratNCBI (USA) 



ScanProsite, MotifScan 




NPSA Sequence analysis 
tools 





Sequence analysis tools: ProtParam, ProtScale, 
Compute pI/Mw, PeptideMass, PeptideCutter, 
Dotlet (Java) 

Submit a homology modeling request to SWISS- 
MODEL 



ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot 
Hosted by Hi NHRI Taiwan Mirror sites: Australia Brazil Canada Korea Switzerland USA 
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tr Q 9 ANY A. Pneumococcal histidine triad protein E precursor 1039 

Q9ANY1_STRPN (Hypothetical AA 

protein SP1004) [phtE] [Streptococcus pneumoniae] align 



Score = 2006 bits (5196), Expect =0.0 

Identities = 998/1039 (96%), Positives = 1000/1039 (96%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

NlKFSKKYIAAGSAyiVSLSLCAYJ^N 
Sb j ct : 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

QKEGICtfU£qiVIKITDQGYVTSHG^ 
Sbjct: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

Query: 121 EVKGGYIIKVDGKYYVYLKDAAHADhT^RTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 180 

EVKGGYIT.ir^GKYYY^^^ 

Sbjct: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 180 

Query: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS 240 

T TND G Y V FN P A D 1 1 E YjT GN A Y I V PK GGH Y H Y I P NMQVSQLS 
Sbjct: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLS 240 

Query: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

YB8TASDNNTQ3VAKGST3KPANKSEN 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Query: 301 TPNGVAI PHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXX 360 

T PNG VAX PHGDK YHFXPY3 KLSALE EKIAR VPISGTGSTVS TNAK PNEVV 
Sbjct: 301 TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSN 360 

Query: 361 XXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

K5 L 3 S AS D GY I FN PKD X VE E TAT AY I VRH GDH FK Y I PK S NQ I GQ PT L PMN S I: A 
Sbjct: 361 PSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

Query: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

TF3PSLPINFG SKEKHEEDGYGFDANRI I AEDE 3 GF+MSHG-i-KNHY FFKKDLTSEQTKA 
Sbjct: 421 TPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKA 48 0 

Query: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

AQKHLEEVKTSHNGLDSLSSHEQDYP NAKEMKD1:DKK;:EEKIAG1HKQYGVKRESIVV7^ 
Sbjct : 481 AQKHLEEVKTSHNGLDSLSSHEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

Query: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

KE KN A A' I Y PH GDHHH A D .P X DE H K PV GX GH S H S K r Y E L FK P E E GVAKKE GKK V Y T GE E L7;N V 
Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

VNLLKNSTFNNQKFTLA^GQKRVSFSFPPELEKKLGINMLVKLXTPDGKVLEKVSGKVFG 
Sbjct: 601 VNLLKNST FNNQNFT LANGQKRVS FS FPPE LEKKLGI NMLVKLI T PDGKVLEKVS GKVFG 660 

Query: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

E GVGH I AMFE LDQ F Y 7.i PGQT FKY T IAS KD Y P E V S Y DGT FT V PT S LAY KMAS QT I F Y ? FH A 
Sbjct: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

Query: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 78 0 

GDT Y LRVNFQ FAV PKGT DALVR VFDE FKGNA Y LE MM Y KYGE I K T.r ? I PKLNQGT TRT AGMK 
Sbjct : 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 78 0 
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Query: 781 IPVTFM7\NAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

I PVT HTyfANAYLDNQSTY I VEVPI LF;KBNQTDK?S X LPQFKP.NKAQHIN KLDEKVSEPKTS 
Sbjct : 781 IPVTFM7\NAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTS 840 

Query: 8 41 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Sbjct: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

S G& V X KK'KMAl; PTGE A PQ GjM GENK P 8 E M GKVS T GT V & N Q PT L PE APN & KP VK P 

Sbjct : 901 SGEVIKKNM7\DFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 102 0 

S N S T DNG.MLN P£ GNV GS T> PMLD :VAL £ F. A P AVD PV QE. KLEJ KFT. A S Y GL GLD S V I V'NMDGT I 
Sbjct: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

£ LR L P S GE VI KKN L 3 D L 1 A 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 1039 
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tr Pneumococcal histidine triad protein E [phtE] 1039 

Q8DQ07_STRR6 [Streptococcus AA 

pneumoniae (strain ATCC BAA-255 / R6) ] align 



Score = 2017 bits (5225), Expect =0.0 

Identities = 1004/1039 (96%), Positives = 1004/1039 (96%) 

query: 1 mkfskkyiaagsavivslslcayalnqhrsqenkdnnrvsyvdgsqssqksenltpdqvs 60 

mk fs kk y i aags av i v s ls lc ay aln qkr s qen rf-nnr vs y v d g s 0 3 5 qk3 en lt pdqvs 
sb j ct : 1 mkeskkyiaagsavivslslcayalnqhrsqenkdnnrvsyvdgsqssqksenltpdqvs 60 

query: 61 qkegiqaeqivikitdqgyvtshgdhyhyyngkvpydalfseellmkdpnyqlkdadivn 120 

qkegiqaeqi vikitdqgyvtshgdkyhyyngkvpy 
sb jet : 61 qkegiqaeqivikitdqgyvtshgdhyhyyngpcvpydalfseellmkdpnyqlkdadivn 120 

query: 121 evkggyiikvdgkyyvylkda7\hadnvrtkdeinrqkqehvkdnekvnsnvavarsqgry 180 

3 vkggy 1 1 kv dgky yvy lkdaahadnvrtkde i nrqkqe}{vkdnekvnsnvavarsqgry 
sbjct: 121 evkggyiikvdgkyyvylkdaahadnvrtkdeinrqkqehvkdnekvnsnvavarsqgry 180 

query : 181 ttndgyvfnpadi iedtgnayi vphgghyhy i pxxxxxxxxxxxxxxxxxxxnmqpsqls 240 

ttndgy v fn pad i i e dt gn a i x v phggh y h y i p nmqpsqls 
sbjct: 181 ttndgyvfnpadi i edtgnay i vphgghyhy i pksdlsaselaaakahlagknmqpsqls 240 

query: 241 ysstasdnntqsvakgstskpanksenlqsllkelydspsaqrysesdglvfdpakiisr 300 

'{. b 3 t ao dnnt q8 v akg s t s k panks & n lq 5? l lk£ l y os p s y s f. s d g l v fd par' i .1 s r 
sbjct: 241 ysstasdnntqsvakgstskpanksenlqsllkelydspsaqrysesdglvfdpakiisr 300 

query: 301 tpngvaiphgdhyhfipysklsaleekiarrvpisgtgstvstnakpnewxxxxxxxxx 360 

tpngv7^iphgdhyhfipysklsalseki;^rvpisgtgstvstn;\kpnevv 
sbjct : 301 tpngvaiphgdhyhfipysklsaleekiarrvpisgtgstvstnakpnewsslgslssn 360 

query: 361 xxxxxxxkelssasdgyifnpkdiveetatayivrhgdhfhyipksnqigqptlpnnsla 420 

ke l s s as d gy i fn p kd i ve f : tat ay i vrh gdh f h y i p ks nq i gq pt l pnn s la 
sbjct: 361 psslttskelssasdgyifnpkdiveetatayivrhgdhfhyipksnqigqptlpnnsla 420 

query: ,421 tpspslpinpgishekheedgygfdanriiaedesgfimshgnhnhyffkkdlteeqika 480 

tps ps lpinfgi shekh3edgy gfdanri i af;df; sgfimshgnt-inh y ffi<kdltef:qi ka 
sbjct: 421 tpspslpinpgishekheedgygfdanriiaedesgfimshgnhnhyffkkdlteeqika 480 

query: 481 aqkhleevktshngldslssheqdypgnakemkdldkkieekiagimkqygvkresiwn 540 

aqkhl^vivktsiingldsx:ssh^qdypgnake^kdi:dkkxefkiagij^kqy gv'kk^stvv'n 
sbjct : 481 aqkhleevktshngldslssheqdypgnakemkdldkkieekiagimkqygvkresiwn 540 

query: 541 keknaiiyphgdhhhadpidehkpvgighshsnyelfkpeegvakkegnkvytgeeltnv 600 

k& kn a.y i. v pkgdkhradpx dskkpvg7. - gksrs k y.& lpxpe.£ gvakkfgk\kv ytgfp.ltnv 
sbjct: 541 keknai i y phgdhhhadpi dehkpvgi ghshsnyelfkpeegvakkegnkvytgeeltnv 600 

query: 601 vnllknstfnnqnftlangqkrvsfsfppelekklginmlvklitpdgkvlekvsgkvfg 660 

v n l lkns t fn nqn ft lan gqkr vs f s f p pe le kklg i nml v kl i t pd gkv l e k vs gkv f g 
sbjct : 601 vnllknstfnnqnftlangqkrvsfsfppelekklginmlvklitpdgkvlekvsgkvfg 660 

query: 661 egvgnianfeldqpylpgqtfkytiaskdypevsydgtftvptslaykmasqtifypfha 720 

e gvgni an fe ldq fy l pgq? fkyt iaskd y ?e vs ydgt ftvpt s lay kmas qt i fy ? fha 
sbjct: 661 egvgnianfeldqpylpgqtfkytiaskdypevsydgtftvptslaykmasqtifypfha 720 

query: 721 gdtylrvnpqfavpkgtdalvrvfdefhgnaylennykvgeiklpipklnqgttrtagnk 7 80 

gdt y lrvnfq f7^v pkgtdalvrvfds fkgha y lsnn ykvge i kl p i pk.lnqgttrt'agnk 
sbjct : 721 gdtylrvnpqfavpkgtdalvrvfdefhgnaylennykvgeiklpipklnqgttrtagnk 780 
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Query: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

IFVTE^TANAYLDKQSTYIV^ 
Sbjct: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLEWLFNMDGTIELYLP 900 

Sbjct: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Sbjct: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Sbjct : 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

£ LRL P S C-E V 1 KKN L3 DL 1 A 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 1039 
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CLUSTAL W (1.74) multiple sequence alignment 



tr | Q9ANY1 1 Q 9ANY 1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr | Q6WNQ5 | Q6WNQ5 STRPN 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr IQ6WNQ5 IQ6WNQ5 STRPN 



MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQK 
MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQK 
MKFSKKYIAAGSAVIVSLSLCAYALNOHRSQENKDNNRVSYVDGSOSSQK 
CAYALNQHRSQENKDNNRVSYVDGSQSSQK 

SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF 
SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF 
SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF 
SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr | Q6WNQ5 | Q6WNQ5 STRPN 



S EE LLMKDPNYQLKDAD I VNE VKGGY 1 1 KVDGKY YVY LKDAAHADNVRTK 
S EE LLMKDPNYQLKDAD I VNE VKGGY 1 1 KVDGKY YVY LKDAAHADNVRTK 
SEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDA71HADNVRTK 
SEE LLMKD PNYQLKDAD I VNE VKGGY 1 1 KVDGKY YVY LKDAAHADNVRTK 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr|Q6WNQ5|Q6WNQ5 STRPN 



DEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 
DE I NRQKQE HVKDNE KVNSNVAVAR S Q GR YTTNDGY V FNPAD HE DTGNA 
DEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 
DE INRQKQE HVKDNE KVNSNVAVARSQGRYTTNDGYVFNPADI I EDTGNA 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr|Q6WNQ5|Q6WNQ5 STRPN 



YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT 

YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT 

YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT 

YIVPHRGHYHYIPKSDLSASEL7WVKAHLAGKNMQPSQLSYSSTASDNNT 
******************************************** 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr | Q6WNQ5 | Q6WNQ5 STRPN 



QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 

QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 

QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 

QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 
************************************************** 



tr | Q9ANY1 1 Q 9AN Y 1_S TR PN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr|Q6WNQ5|Q6WNQ5 STRPN 



TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEV 

TPNGVAI PHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEV 

TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEV 

TPNGVAI PHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEV 
****************************** ******************* 



tr | Q9ANY1 | Q9ANY1JSTRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr|Q6WNQ5|Q6WNQ5 STRPN 



VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF 
VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF 
VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF 
VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
trtQ6WNQ5|Q6WNQ5 STRPN 



HYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRII 

HYIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRII 

HYIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRII 

HYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRII 

★★★★★★★★★★★★★★★★★★ 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr|Q6WNQ7|Q6WNQ7 STRPN 



AEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 
AEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 
AEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 
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tr|Q6WNQ5|Q6WNQ5 STRPN 



AEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 
******* - **** - ************************************* 



tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 



tr 
tr 
tr 
tr 



tr 
tr 
tr 



Q9ANY1 1 Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 i Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 1 Q9ANY1__STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 IQ6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 I Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 |Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 IQ6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 IQ6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 |Q8DQ07_JSTRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q 9 ANY 1_STR PN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7 STRPN 



HEQDY PSNAKEMKDLDKKI EEKIAGIMKQYGVKRES I WNKEKNAI I YPH 
HEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPH 
HEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPH 

HEQD Y PSNAKEMKDLDKKI E EKIAGIMKQYGVKRES I AAANKEKNAI I YPH 

★ *•*★★★★*★★★★★★★★*★★★*★★★★*★★★★★★★★★*★★★★★★★★ 

GDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 
GDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 
GDHHH7\DPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 
GDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 

★ ★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★it*** 



VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKV 
VNLLKNS T FNNQNFT LANGQKRVS FS F P PE LEKKLGI NMLVKL I TPDGKV 
VNLLKNSTFNNQNFTL7\NGQKRVSFSFPPELEKKLGINMLVKLITPDGKV 
VNLLKNS T FNNQNFT LAN GQKRVSFSFPPE LEKKLGI NMLVKL I TPDGKV 

LEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFT 
LEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFT 
LEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFT 
LEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFT 



V PT S LAY KMAS QT I F Y P FHAGDT Y LR VN PQ FAV PKGTDALVR VFDE FHGN 
VPTS LAYKMAS QT I FY PFHAGDT Y LRVNPQFAVPKGTDALVRVFDE FHGN 
VPTS LAYKMAS QT I FY PFHAGDTY LRVNPQ FAV PKGTDALVR VFDE FHGN 
VPTS LAYKMAS QT I FY P FHAGDT YLRVNPQ FAV PKGTDALVRVFD E FHGN 

AYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVE 
AY LENNYKVGE I KLPI PKLNQGTTRTAGNKI PVT FMANAY LDNQST Y I VE 
AY LENNY KVGE I KLPI PKLNQGTTRTAGNKI PVTFMANAYLDNQSTY IVE 
AYLENNYKVGEIKLPI PKLNQGTTRTAGNKI PVT FMANAY LDNQST Y I VE 

VPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTSEKVEKEKLSE 
VPI LEKENQTDKPS I LPQFKRNKAQENSKLDEKVEE PKTSEKVEKEKLSE 
VPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSE 
VPI LEKENQTDKPS I LPQFKRNKAQENSKFDEKVEE PKTSEKVEKEKLSE 

★ . ******************** 

TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
************************************************** 



SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLP 
SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLP 
SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLP 
SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLP 
********* *****★★**★*★★ ************** ****★****★★★★* 

EAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEK 
EAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEK 
EAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEK 
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tr | Q6WNQ5 | Q6WNQ5_STRPN EAPNEKPVK PENS TDNGMLNPEGNVGSDPMLD PALE EAPAVDPVQEKLEK 

tr | Q9ANY1 | Q9ANY1_STRPN FTASYGLGLDSVI FNMDGTIELRLPSGEVIKKNLSDLIA 

tr | Q8DQ07 | Q8DQ07_STRR6 FTASYGLGLDSVI FNMDGTIELRLPSGEVIKKNLSDLIA 

t r | Q6WNQ7 | Q6WNQ7JSTRPN FTASYGLGLDSVI FNMDGTIELRLPSGEVIKKNLSDLIA 

tr | Q6WNQ5 | Q6WNQ5_STRPN FTASYGLGLDSVI FNMDGTIELRLPSGEVIKKNLSDLIA 

★ ★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★a-**** 
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PileUp 



MSF: 1039 Type: P 



Check: 



Name: tr | Q9ANY1 | Q9ANY1_STRPN oo 

Name: tr | Q8DQ07 | Q8DQ07_STRR6 oo 

Name: tr | Q6WNQ7 | Q6WNQ7_STRPN oo 

Name: tr | Q6WNQ5 | Q6WNQ5 STRPN oo 



// 



304 

Len: 
Len : 
Len: 
Len: 



1039 Check: 9358 Weight: 0.100 

1039 Check: 8867 Weight: 0.100 

1039 Check: 8867 Weight: 0.100 

1039 Check: 3212 Weight: 0.100 



tr | Q9ANY1 | Q9ANY 1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr | Q6WNQ5 | Q6WNQ5_STRPN 



MKFSKKYIAA GSAVIVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK 
MKFSKKYIAA GSAVIVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK 
MKFSKKYIAA GSAVIVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK 
CAYALNQHRS QENKDNNRVS YVDGSQSSQK 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07 JSTRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr | Q6WNQ5 | Q6WNQ5 STRPN 



SENLTPDQVS QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF 

SENLTPDQVS QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF 

SENLTPDQVS QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF 

SENLTPDQVS QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr IQ6WNQ5 IQ6WNQ5 STRPN 



SEELLMKDPN YQLKDADIVN EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 
SEELLMKDPN YQLKDADIVN EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 
SEELLMKDPN YQLKDADIVN EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 
SEELLMKDPN YQLKDADIVN EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 



tr | Q9ANY1 | Q 9 AN Y 1_S T R PN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr|Q6WNQ5|Q6WNQ5 STRPN 



DEINRQKQEH VKDNEKVNSN VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 

DEINRQKQEH VKDNEKVNSN VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 

DEINRQKQEH VKDNEKVNSN VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 

DEINRQKQEH VKDNEKVNSN VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 



tr | Q9ANY1 1 Q 9 AN Y 1 _S T R PN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr IQ6WNQ5 IQ6WNQ5 STRPN 



YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA GKNMQPSQLS YSSTASDNNT 

YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA GKNMQPSQLS YSSTASDNNT 

YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA GKNMQPSQLS YSSTASDNNT 

YIVPHRGHYH YIPKSDLSAS ELAAAKAHLA GKNMQPSQLS YSSTASDNNT 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr IQ6WNQ5 [Q6WNQ5 STRPN 



QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISR 

QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISR 

QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISR 

QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISR 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7J5TRPN 
tr|Q6WNQ5|Q6WNQ5 STRPN 



TPNGVAIPHG DHYHFIPYSK LSALEEKIAR MVPISGTGST VSTNAKPNEV 
TPNGVAIPHG DHYHFIPYSK LSALEEKIAR RVPISGTGST VSTNAKPNEV 
TPNGVAIPHG DHYHFIPYSK LSALEEKIAR RVPISGTGST VSTNAKPNEV 
TPNGVAIPHG DHYHFIPYSK LSALEEKIAR MVPISGTGST VSTNAKPNEV 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7 STRPN 



VSSLGSLSSN PSSLTTSKEL SSASDGYIFN PKDIVEETAT AYIVRHGDHF 
VSSLGSLSSN PSSLTTSKEL SSASDGYIFN PKDIVEETAT AYIVRHGDHF 
VSSLGSLSSN PSSLTTSKEL SSASDGYIFN PKDIVEETAT AYIVRHGDHF 
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tr|Q6WNQ5IQ6WNQ5 STRPN 



VSSLGSLSSN PSSLTTSKEL SSASDGYIFN PKDIVEETAT AYIVRHGDHF 



Q9ANY1 | Q9ANY 1_STRPN 
Q8DQ07 t Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY 1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 IQ6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7J3TRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 IQ6WNQ5 STRPN 



Q9ANY1 | Q9ANY 1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 | Q6WNQ7_STRPN 
Q6WNQ5 | Q6WNQ5 STRPN 



Q9ANY1 | Q9ANY1_STRPN 
Q8DQ07 | Q8DQ07_STRR6 
Q6WNQ7 IQ6WNQ7 STRPN 



HYIPKSNQIG QPTLPNNSLA TPSPSLPINP GTSHEKHEED GYGFDANRII 

HYIPKSNQIG QPTLPNNSLA TPSPSLPINP GISHEKHEED GYGFDANRII 

HYIPKSNQIG QPTLPNNSLA TPSPSLPINP GISHEKHEED GYGFDANRII 

HYIPKSNQIG QPTLPNNSLA TPSPSLPINP GTSHEKHEED GYGFDANRII 



AEDESGFVMS HGDHNHYFFK KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 

AEDESGFIMS HGNHNHYFFK KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 

AEDESGFIMS HGNHNHYFFK KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 

AEDESGFVMS HGDHNHYFFK KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 



HEQDYPSNAK EMKDLDKKIE EKIAGIMKQY GVKRESIWN KEKNAIIYPH 

HEQDYPGNAK EMKDLDKKIE EKIAGIMKQY GVKRESIWN KEKNAIIYPH 

HEQDYPGNAK EMKDLDKKIE EKIAGIMKQY GVKRESIWN KEKNAIIYPH 

HEQDYPSNAK EMKDLDKKIE EKIAGIMKQY GVKRESIWN KEKNAIIYPH 



GDHHHADPID EHKPVGIGHS HSNYELFKPE EGVAKKEGNK VYTGEELTNV 

GDHHHADPID EHKPVGIGHS HSNYELFKPE EGVAKKEGNK VYTGEELTNV 

GDHHHADPID EHKPVGIGHS HSNYELFKPE EGVAKKEGNK VYTGEELTNV 

GDHHHADPID EHKPVGIGHS HSNYELFKPE EGVAKKEGNK VYTGEELTNV 



VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE LEKKLGINML VKLITPDGKV 

VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE LEKKLGINML VKLITPDGKV 

VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE LEKKLGINML VKLITPDGKV 

VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE LEKKLGINML VKLITPDGKV 



LEKVSGKVFG EGVGNIANFE LDQPYLPGQT FKYTIASKDY PEVSYDGTFT 

LEKVSGKVFG EGVGNIANFE LDQPYLPGQT FKYTIASKDY PEVSYDGTFT 

LEKVSGKVFG EGVGNIANFE LDQPYLPGQT FKYTIASKDY PEVSYDGTFT 

LEKVSGKVFG EGVGNIANFE LDQPYLPGQT FKYTIASKDY PEVSYDGTFT 



VPTSLAYKMA SQTIFYPFHA GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN 

VPTSLAYKMA SQTIFYPFHA GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN 

VPTSLAYKMA SQTIFYPFHA GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN 

VPTSLAYKMA SQTIFYPFHA GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN 



AY LENNY KVG EIKLPIPKLN QGTTRTAGNK IPVTFMANAY LDNQSTYIVE 

AY LENNY KVG EIKLPIPKLN QGTTRTAGNK IPVTFMANAY LDNQSTYIVE 

AY LENNY KVG EIKLPIPKLN QGTTRTAGNK IPVTFM7\NAY LDNQSTYIVE 

AY LENNY KVG EIKLPIPKLN QGTTRTAGNK IPVTFMANAY LDNQSTYIVE 



VPILEKENQT DKPSILPQFK RNKAQENLKL DEKVEEPKTS EKVEKEKLSE 

VPILEKENQT DKPSILPQFK RNKAQENSKL DEKVEEPKTS EKVEKEKLSE 

VPILEKENQT DKPSILPQFK RNKAQENSKL DEKVEEPKTS EKVEKEKLSE 

VPILEKENQT DKPSILPQFK RNKAQENSKF DEKVEEPKTS EKVEKEKLSE 



TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY GMKLENVLFN MDGTIELYLP 
TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY GMKLENVLFN MDGTIELYLP 
TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY GMKLENVLFN MDGTIELYLP 
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tr|Q6WNQ5|Q6WNQ5 STRPN TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY GMKLENVLFN MDGTIELYLP 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr i Q6WNQ7 | Q6WNQ7_STRPN 
tr | Q6WNQ5 | Q6WNQ5 STRPN 



SGEVIKKNMA DFTGEAPQGN GENKPSENGK VSTGTVENQP TENKPADSLP 

SGEVIKKNMA DFTGEAPQGN GENKPSENGK VSTGTVENQP TENKPADSLP 

SGEVIKKNMA DFTGEAPQGN GENKPSENGK VSTGTVENQP TENKPADSLP 

SGEVIKKNMA DFTGEAPQGN GENKPSENGK VSTGTVENQP TENKPADSLP 



tr | Q9ANY1 1 Q 9 ANY 1_S T R PN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr | Q6WNQ5 | Q6WNQ5 STRPN 



EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM LDPALEEAPA VDPVQEKLEK 

EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM LDPALEEAPA VDPVQEKLEK 

EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM LDPALEEAPA VDPVQEKLEK 

EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM LDPALEEAPA VDPVQEKLEK 



tr | Q9ANY1 | Q9ANY1_STRPN 
tr | Q8DQ07 | Q8DQ07_STRR6 
tr | Q6WNQ7 | Q6WNQ7_STRPN 
tr | Q6WNQ5 | Q6WNQ5_STRPN 



FTASYGLGLD SVIFNMDGTI 

FTASYGLGLD SVIFNMDGTI 

FTASYGLGLD SVIFNMDGTI 

FTASYGLGLD SVIFNMDGTI 



ELRLPSGEVI KKNLSDLIA 

ELRLPSGEVI KKNLSDLIA 

ELRLPSGEVI KKNLSDLIA 

ELRLPSGEVI KKNLSDLIA 
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